Multi-Agent Systems¶

Patterns for designing, coordinating, and operating systems where multiple agents collaborate — from topology selection through fan-out parallelism to production deployment.

Topology & Architecture¶

Choosing the right structure for agent collaboration determines failure modes, latency, and coordination overhead.

Multi-Agent Topology Taxonomy: Centralised, Decentralised, and Hybrid — Choosing the wrong coordination topology for a task type is a primary source of production agent failures; each topology carries distinct failure modes
Multi-Agent SE Design Patterns: A Taxonomy Across 94 Papers — A systematic study of 94 LLM-based multi-agent SE papers identifies 16 design patterns, with Role-Based Cooperation as the dominant pattern
Orchestrator-Worker Pattern — A lead agent decomposes a complex task and assigns independent subtasks to specialized workers running in parallel
Magentic Orchestration: Task-Ledger-Driven Adaptive Multi-Agent Planning — A manager agent maintains a task ledger of facts, guesses, and plan, dispatches specialists, and revises the plan when progress stalls — fit only when the plan itself is the unknown
Specialist Orchestrated Queuing for Multi-Agent SE (SPOQ) — Composes wave-based topological dispatch, dual validation gates, a tiered Opus/Sonnet/Haiku roster, and Human-as-an-Agent — gains hold only when four upstream conditions are met
System-Level Optimization Pipeline — A four-stage agent pipeline decomposes performance engineering into summarization, analysis, optimization, and verification across component boundaries
Oracle-Based Task Decomposition — Introduce a reference oracle to generate per-unit expected outputs, converting one monolithic task into hundreds of independently verifiable subtasks
Reverse-Engineered Executable Specifications for Agentic Program Repair — Decompose automated program repair into specification inference then constrained patching, making the intended behaviour an inspectable intermediate artefact before any code is changed
Declarative Multi-Agent Composition — Define agents and workflows as structured data, then compose them through explicit wiring rather than imperative code
Declarative Multi-Agent Topology: Topology-as-Code — Encode an entire agent graph in a single declarative file that compiles to any target runtime, making topology auditable and portable

Fan-Out & Parallelism¶

Strategies for splitting work across parallel agents — and controlling the blast radius of concurrent execution.

Cohesion-Aware Task Partitioning for Multi-Agent Coding — Partition by dependency cohesion before fanning out — parallelism only pays off when cross-partition dependencies are sparse enough that context-transfer cost stays below the compute saved on the critical path
Fan-Out Synthesis Pattern — Spawn N independent agents to solve the same problem in parallel, then use a synthesis agent to merge the strongest elements from each attempt
Recursive Best-of-N Delegation — Run K parallel candidate workers at each recursion node and select the best result via a judge before the parent consumes it — preventing error compounding in recursive agent trees
Recursive Sub-Agent Delegation: Depth Limits and Trade-offs in Nested Hierarchies — Depth is a design lever distinct from fan-out width; each level buys one isolated context window at the cost of compounding tokens, latency, summarisation loss, and tracing burden
Contextual Capability Calibration for Multi-Agent Delegation — Replace static skill-level agent profiles with context-specific Beta posteriors so routing decisions condition on the task features that actually predict success
Parsimonious Agent Routing — A learned router emits one delegation plan — keep, single-route, or split-and-route — with per-branch budget, jointly optimizing decisions that hand-engineered pipelines treat as independent
Sub-Agents for Fan-Out Research and Context Isolation — Spawn sub-agents to parallelize independent work in isolated context windows; the main thread receives only distilled results
Adaptive Sandbox Fan-Out Controller — Start with a small parallel batch, monitor quality signals, then scale up, stop early, refine the prompt, or decompose — rather than committing to a fixed N upfront
Swarm Migration Pattern — Coordinate 10–20 parallel subagents to migrate large codebases atomically, achieving 6–10x speedup for qualifying file-independent transformations
Bounded Batch Dispatch — Process large agent workloads without hitting API rate limits by dispatching work in sequential batches of fixed size
Staggered Agent Launch — Launch parallel agents 30 seconds apart to break the thundering-herd dynamic so each agent claims work before the next reads the queue
Async Non-Blocking Subagent Dispatch — Decouple the orchestrator's processing loop from subagent lifecycle so it continues planning and processing partial results while delegates execute concurrently
LLM Map-Reduce Pattern — Split a large input into context-window-sized chunks, process each chunk independently, then combine chunk-level results into a coherent output

Coordination¶

How agents hand off work, share state, and refine each other's output without a centralized controller.

Economic Value Signaling in Multi-Agent Networks — Attach token values to inter-agent messages so agents self-sort by task priority without a central scheduler
Agent Handoff Protocols: Passing Work Between Agents — Define explicit contracts between pipeline stages to prevent information loss at handoff points
Lead-to-Teammate Plan-Approval Handshake — A teammate runs in read-only plan mode until the lead approves its plan, gating writes on peer review at the cheapest point in the cycle
Forked vs Fresh Subagents: When to Inherit the Parent Conversation — Fork when the parent's mental model is an asset the brief cannot reconstruct; start fresh when bias, trifecta exposure, or token budget make inherited context a liability
File-Based Agent Coordination — Coordinate parallel agents using lightweight file locks in a shared repository; git merge mechanics enforce task exclusivity without a central orchestrator
Observation-Driven Coordination: CRDT-Based Parallel Agent Code Generation — CRDT-based shared state enables lock-free concurrent code generation with zero structural merge conflicts
Closed-Loop Role-Based Refinement — Decompose the self-improving agent loop into discrete, specialized roles with persistent knowledge layers, staged validation, and gated persistence
Independent Test Generation in Multi-Agent Code Systems — Separate code and test generation into independent agents so the test writer never sees the code, preventing bias that cuts test accuracy by 30%
Persistent Shared Search Sub-Agent for Output-Token Reuse — Route repository lookups through one persistent searcher so a region is explored once, cutting the redundant output tokens that dominate multi-agent cost
Context-Graph Shared Memory for Multi-Agent Systems — Store cross-agent shared state as typed triples and traverse edges for join queries — beats vector RAG only when queries chain facts and entities are clean

Multi-Model¶

Patterns that leverage multiple distinct models — using diversity of reasoning to strengthen outputs.

Adversarial Multi-Model Development Pipeline (VSDD) — A six-phase pipeline where a fresh-context adversary attacks builder output until convergence, combining spec-driven development, TDD, and formal verification
Multi-Model Plan Synthesis — Get independent plans from multiple frontier models, then synthesize a hybrid architecture from the strongest ideas of each before writing code
Voting / Ensemble Pattern — Run the same task N times in parallel, then aggregate results through voting to trade compute for confidence
Opponent Processor / Multi-Agent Debate — Deploy two agents with structurally opposed incentives to independently critique each other's reasoning, then synthesize into a higher-quality decision

Operational¶

Deploying, observing, and optimizing multi-agent systems in production.

Rainbow Deployments for Agents — Shift traffic between agent versions gradually so new versions prove themselves alongside old ones before full cutover
Emergent Behavior Sensitivity — Small changes to a lead agent's prompt unpredictably alter subagent behavior; multi-agent prompts must be frameworks for collaboration, not rigid instructions
Semantic Caching for Multi-Agent Code Systems — Semantic caching with LLM-based equivalence detection achieves 67% cache hit rates and reduces token consumption by 40-60%
Subagent Schema-Level Tool Filtering — Restrict subagent capabilities by filtering their tool schemas, making unauthorized tool use structurally impossible
Cross-Tool Subagent Comparison — Side-by-side comparison of Claude Code, Gemini CLI, and Copilot CLI subagents on definition format, context isolation, tool scoping, and composition