Skip to content

Multi-Agent Systems

Patterns for designing, coordinating, and operating systems where multiple agents collaborate — from topology selection through fan-out parallelism to production deployment.

Topology & Architecture

Choosing the right structure for agent collaboration determines failure modes, latency, and coordination overhead.

Fan-Out & Parallelism

Strategies for splitting work across parallel agents — and controlling the blast radius of concurrent execution.

  • Cohesion-Aware Task Partitioning for Multi-Agent Coding — Partition by dependency cohesion before fanning out — parallelism only pays off when cross-partition dependencies are sparse enough that context-transfer cost stays below the compute saved on the critical path
  • Fan-Out Synthesis Pattern — Spawn N independent agents to solve the same problem in parallel, then use a synthesis agent to merge the strongest elements from each attempt
  • Recursive Best-of-N Delegation — Run K parallel candidate workers at each recursion node and select the best result via a judge before the parent consumes it — preventing error compounding in recursive agent trees
  • Recursive Sub-Agent Delegation: Depth Limits and Trade-offs in Nested Hierarchies — Depth is a design lever distinct from fan-out width; each level buys one isolated context window at the cost of compounding tokens, latency, summarisation loss, and tracing burden
  • Contextual Capability Calibration for Multi-Agent Delegation — Replace static skill-level agent profiles with context-specific Beta posteriors so routing decisions condition on the task features that actually predict success
  • Parsimonious Agent Routing — A learned router emits one delegation plan — keep, single-route, or split-and-route — with per-branch budget, jointly optimizing decisions that hand-engineered pipelines treat as independent
  • Sub-Agents for Fan-Out Research and Context Isolation — Spawn sub-agents to parallelize independent work in isolated context windows; the main thread receives only distilled results
  • Adaptive Sandbox Fan-Out Controller — Start with a small parallel batch, monitor quality signals, then scale up, stop early, refine the prompt, or decompose — rather than committing to a fixed N upfront
  • Swarm Migration Pattern — Coordinate 10–20 parallel subagents to migrate large codebases atomically, achieving 6–10x speedup for qualifying file-independent transformations
  • Bounded Batch Dispatch — Process large agent workloads without hitting API rate limits by dispatching work in sequential batches of fixed size
  • Staggered Agent Launch — Launch parallel agents 30 seconds apart to break the thundering-herd dynamic so each agent claims work before the next reads the queue
  • Async Non-Blocking Subagent Dispatch — Decouple the orchestrator's processing loop from subagent lifecycle so it continues planning and processing partial results while delegates execute concurrently
  • LLM Map-Reduce Pattern — Split a large input into context-window-sized chunks, process each chunk independently, then combine chunk-level results into a coherent output

Coordination

How agents hand off work, share state, and refine each other's output without a centralized controller.

Multi-Model

Patterns that leverage multiple distinct models — using diversity of reasoning to strengthen outputs.

Operational

Deploying, observing, and optimizing multi-agent systems in production.

  • Rainbow Deployments for Agents — Shift traffic between agent versions gradually so new versions prove themselves alongside old ones before full cutover
  • Emergent Behavior Sensitivity — Small changes to a lead agent's prompt unpredictably alter subagent behavior; multi-agent prompts must be frameworks for collaboration, not rigid instructions
  • Semantic Caching for Multi-Agent Code Systems — Semantic caching with LLM-based equivalence detection achieves 67% cache hit rates and reduces token consumption by 40-60%
  • Subagent Schema-Level Tool Filtering — Restrict subagent capabilities by filtering their tool schemas, making unauthorized tool use structurally impossible
  • Cross-Tool Subagent Comparison — Side-by-side comparison of Claude Code, Gemini CLI, and Copilot CLI subagents on definition format, context isolation, tool scoping, and composition
Feedback