Skip to content

Anthropic's Effective Agents Framework: A Pattern Map

Anthropic's Building Effective Agents framework names three building blocks — augmented LLM, five workflow patterns, autonomous agent loop — each conditional on its own assumptions.

Overview

Anthropic's December 2024 engineering post and the expanded 'Building Effective AI Agents' eBook define the most-cited taxonomy in agent-pattern literature (Anthropic: Building Effective Agents; eBook landing). This page maps that taxonomy to site pages and the conditions under which each layer holds.

Anthropic names three layers:

  • Augmented LLM — an LLM plus retrieval, tools, and memory
  • Workflows — five patterns with predefined control flow, built on the augmented LLM
  • Agents — the autonomous loop, where the LLM directs its own control flow

Anthropic's central advice is to start with simple prompts. Add agentic systems only when simpler solutions fall short.

The augmented LLM

The base unit is an LLM extended with three capabilities (Anthropic blog):

  • Retrieval — the model writes its own search queries and reads the results
  • Tools — the model picks and runs tools, then reads the output
  • Memory — the model decides what to keep across turns

Anthropic recommends tailoring each capability behind a well-documented interface. It cites the Model Context Protocol as one way to connect them. See Agent Memory Patterns, Tool Engineering, and Externalization in LLM Agents.

The five workflow patterns

Anthropic pattern What it does Site page
Prompt chaining Decompose a task into sequential LLM calls with programmatic checkpoints between steps Prompt Chaining
Routing Classify input and dispatch to a specialized downstream prompt Parsimonious Agent Routing
Parallelization Run independent LLM calls concurrently (sectioning) or repeat the same call for consensus (voting) Fan-Out Synthesis
Orchestrator-workers A central LLM decomposes the task at runtime and dispatches dynamic subtasks to worker LLMs Orchestrator-Worker
Evaluator-optimizer A generator produces output; a separate evaluator critiques it; the loop continues until a quality threshold passes Evaluator-Optimizer

All five share one contract: control flow is predefined in code, not produced by the LLM. That is the dividing line between workflow and agent in Anthropic's taxonomy.

The autonomous agent loop

An agent receives a task, plans on its own, takes tool actions, observes results, and either asks for human input or continues until a stopping condition fires. The LLM owns control flow (Anthropic blog). Anthropic recommends agents for open-ended problems where you cannot predict the step count. See Goal-Driven Autonomous Loop, Loop Strategy Spectrum, and Multi-Agent Topology Taxonomy.

Workflows vs agents: the decision

Anthropic's decision criteria (Anthropic blog):

Dimension Workflows Agents
Best for Fixed-step tasks Open-ended tasks with unpredictable paths
Predictability High — explicit code paths Lower — LLM directs control flow
Cost / latency Lower per invocation Higher; compounding error rates per extra LLM call
Debuggability Easier — failure localizes to a step Harder — requires sandboxed testing and guardrails

The site treats this as a spectrum, not a binary. See Agentless vs Autonomous for the case that simple two-phase workflows often beat autonomous agents, and The Delegation Decision for matching task characteristics to delegation depth.

What the eBook adds beyond the blog

The expanded eBook covers material the December 2024 post does not (eBook landing):

Why it works

The framework's value is taxonomic, not algorithmic. It names five patterns with distinct cost, predictability, and debuggability profiles. That turns "build an agent" into a discrete choice: which control-flow shape — sequence, branch, parallel, dispatch, refine — fits this problem? Those shapes are classical CS primitives, and the contribution is matching shape to task before you write code.

Every extra LLM call adds latency, cost, and a compounding error rate. Workflows beat agents whenever a task's structure is stable enough to encode in code, because they pay the inference cost only at decision points the developer chose (Anthropic blog).

When this backfires

The patterns assume the task is well-specified and the output is verifiable. In production, those assumptions fail in several settings (Towards AI: Beyond Anthropic's Playbook):

  • High-frequency, low-complexity tasks — deterministic code beats both workflows and agents on cost and latency (aimultiple: Building AI Agents)
  • No clear evaluation criterion — evaluator-optimizer becomes circular when the evaluator cannot reliably tell good output from bad
  • High-stakes one-shot decisions — the framework's autonomous-loop guidance under-specifies adversarial verification and external grounding
  • Retrieval-bottlenecked tasks — when RAG correctness dominates outcome quality, workflow-pattern choice is a second-order concern
  • Teams with existing frameworks — for teams without infra, LangGraph or CrewAI encode these patterns at lower upfront cost than re-implementation
  • Definitional drift — independent reviewers note that the post's definitions of "agentic", "workflows", and "agents" are not internally consistent (thoughtsfromthedatafront analysis)
  • Parallel sub-agents sharing implicit context — when parallelization or orchestrator-workers is pushed into autonomous multi-agent territory, Cognition argues the pattern turns fragile: concurrent agents that lack each other's implicit design decisions make conflicting choices that do not compose (Cognition: Don't Build Multi-Agents). Their follow-up narrows the safe envelope to topologies where multiple agents contribute intelligence but writes stay single-threaded (Cognition: Multi-Agents — What's Actually Working), the opposite of the dynamic-dispatch framing the original post leaves open

The site's Anti-Patterns section catalogues specific failure modes for several of these workflow shapes.

Key Takeaways

  • The augmented LLM (retrieval + tools + memory) is the substrate; the five workflow patterns and the autonomous loop are built on it
  • Workflows have predefined control flow in code; agents put the LLM in charge of control flow — that boundary determines cost, predictability, and debuggability
  • Start with the simplest viable approach. The marginal-cost threshold for adding a pattern is one extra LLM call's latency and error rate
  • The framework's patterns are taxonomic, not algorithmic — they name shape choices a developer must still match to task structure
  • Apply the framework with conditions: each pattern fails predictably in specific settings that the original post does not enumerate
Feedback