Skip to content

Effort

Effort controls how hard AIBA pushes. You pick it at launch (Step 3) — it determines the model's behavior, the resource budget, and the depth of investigation. Same mode, same template. Three different intensities.


There are three effort levels in AIBA: quick, balanced, and max


quick

Fast. Cheap. Shallow. Built for answers you need in seconds, not minutes.

The agent uses minimal tool calls, skips pagination, and avoids the browser unless a page absolutely requires it. One or two sources are enough — get the answer and move on.

In swarm mode, the orchestrator plans 1–2 waves max, then synthesizes immediately. No looping. No chasing leads.

Best for: Quick lookups. Fact-checking. "What is X?" questions. Testing a prompt before running it deeper.


balanced

Thorough but pragmatic. The daily driver.

The agent cross-checks facts across 2–3 sources, uses the browser for dynamic content, and paginates where it matters — but doesn't exhaust every page. It delivers structured, well-organized results without burning through your quota.

In swarm mode, the orchestrator plans 2–3 waves. After wave 2 it asks: "Do I have enough to answer well?" If yes, it synthesizes. No chasing diminishing returns.

Best for: Most things. Research tasks. Multi-source verification. The default you'll use 80% of the time.


max

Exhaustive. Every tool. Every page. Every lead.

The agent leaves no stone unturned. It stacks browser navigation with visual screenshots, JavaScript evaluation, multi-hop navigation, and full pagination exhaustion. It cross-verifies across every available source. In max mode only, Gemini gets extra reasoning compute — it thinks harder before acting.

In swarm mode, the orchestrator plans 4–8 waves across multiple vectors. It chases pivot leads recursively. After wave 5 it shifts toward synthesis to ensure a deliverable ships before the budget runs out.

Best for: Deep-dive investigations. Competitive analysis. OSINT dossiers. Anything where completeness matters more than cost.


What Effort Changes

Dimension quick balanced max
Temperature 0.3 (deterministic) 0.5 (flexible) 0.7 (creative)
Max tokens / response 4,096 8,192 16,384
Request limit 15 25 50
Tool call limit 20 40 100
Total token budget 100K 300K 500K
Timeout 30s 60s 120s
Thinking (Gemini) Off Off High

Effort Across Modes

Effort applies to both execution modes, but it means something slightly different in each.

In Agent mode: The sub-agent receives behavioral instructions matching your effort — concise for quick, thorough for balanced, exhaustive for max.

In Swarm mode: Both layers get effort-specific guidance. The orchestrator gets wave-planning instructions (1–2, 2–3, or 4–8 waves). Every spawned sub-agent gets matching depth instructions (minimal tools, cross-check, or full arsenal).

Same effort, same budget — applied to every agent in the system.


How to Choose

Pick quick when:

  • You need a fast answer — fact, date, definition
  • You're testing a prompt before committing more resources
  • Token cost matters

Pick balanced when:

  • You're doing real research — not just a lookup
  • You want cross-verified results without overkill
  • You're running job_search for a focused search

Pick max when:

  • Completeness is non-negotiable
  • You're running osint or a deep competitive analysis
  • The task branches — each finding opens new leads
  • You're running in swarm mode and want full pivot detection