Continual Learning for AI Agents: Three Layers of Knowledge Accumulation¶

AI agents accumulate knowledge at three layers — model, harness, and context — and routing a fix to the wrong layer wastes effort.

The three layers¶

LangChain's analysis defines three update targets in any agentic system:

Layer	What it covers	Update mechanism
Model	Neural network weights	Fine-tuning (SFT, RL)
Harness	Scaffold code plus instructions and tools that are always present	Code changes, prompt rewrites
Context	Instructions, skills, and memory that live outside the harness and configure it per agent/user/org	File edits, memory writes

These are independent update targets, not a hierarchy. A context-layer failure does not require a model update, and the reverse holds too.

Model layer¶

Model-layer learning updates the weights themselves — supervised fine-tuning (SFT) or reinforcement learning methods like GRPO.

The central challenge is catastrophic forgetting: new training degrades performance on tasks the model handled before. This is an open research problem.

Model updates target the agent level — one model per agentic system, the least mutable tier in layered mutability. Per-user weight updates (for example, LoRA per user) remain a research direction; production deployments are rare.

Model updates are expensive, slow, and hardest to reverse. Use them when the capability gap cannot be closed by better instructions or context.

Harness layer¶

The harness is the scaffold code that runs the agent, plus instructions and tools always present for every instance. Harness-layer learning rewrites the scaffold.

The Meta-Harness approach formalizes this: run the agent over a batch of tasks, store traces, then have a coding agent propose scaffold changes from those traces. LangChain applied this to Deep Agents, improving Terminal Bench 2.0 from 52.8% to 66.5% through harness changes alone.

Harness updates affect every instance, so a fix generalizes across users and sessions. The tradeoff: changes require code review and deployment, and a bad harness change degrades everyone at once.

Context layer¶

Context sits outside the harness and configures it: skills, instructions, and memory specific to an agent instance, user, or organization. Also called agent memory.

Context updates can be scoped at multiple levels:

Agent level — a persistent configuration the agent updates across sessions (for example, OpenClaw's SOUL.md, which the agent updates over time)
User or tenant level — per-user context that accumulates preferences and conventions (for example, Hex Context Studio, Decagon Duet)
Org level — shared context across a team or organization

These scopes coexist: an agent can update its own SOUL.md, accept user-level corrections, and pull from org-level rules.

Updates happen in two modes:

Offline (batch) — after execution, a background job analyzes traces and updates context. OpenClaw calls this "dreaming".
Hot path (inline) — the agent updates memory mid-task, either on user instruction or harness direction.

Context-layer updates — the agent memory tier — are cheapest and easiest to reverse. Edit a file, reload context. The tradeoff: context has limited scope — it does not improve base model capability and only affects instances that load it.

Cheapness masks silent failure modes. 2026 practitioner reports document stale memories surfacing after facts change and recurring-correction loops where a written-down rule loses to competing retrievals. Retrieval quality, recency bias, and eviction policy decide whether an update actually lands.

Choosing the right layer¶

graph TD
    A[Recurring failure observed] --> B{Is it a capability the model lacks fundamentally?}
    B -->|Yes| C[Model layer — fine-tune]
    B -->|No| D{Does it affect all users the same way?}
    D -->|Yes, shared scaffold issue| E[Harness layer — update scaffold]
    D -->|No, per-user or per-agent| F[Context layer — update instructions/memory]

The common anti-pattern is reaching for fine-tuning when a context update would suffice. A user convention is a context update — not a model problem.

Trade-offs at a glance¶

Dimension	Model	Harness	Context
Reversibility	Hard — requires retraining	Medium — requires deploy	Easy — edit a file
Generalization	Broadest — all instances, all tasks	All instances of this agent	Scoped to target level
Cost	Highest	Medium	Lowest
Latency to deploy	Days–weeks	Hours–days	Minutes
Risk of regression	Catastrophic forgetting	Breaks all instances	Scoped to loaded context

Traces as the common substrate¶

All three update flows consume execution traces. The mechanism differs per layer:

Model: collect traces, label outcomes, fine-tune
Harness: feed traces to a coding agent that proposes scaffold changes
Context: extract conventions and preferences from traces, write to memory files

Trace quality is a prerequisite for improvement at any layer.

Example¶

Claude Code maps cleanly to the three layers:

Model: claude-sonnet or similar — updated by Anthropic
Harness: the Claude Code application itself — updated when you upgrade the CLI
Context: CLAUDE.md, /skills, mcp.json — updated by you or the agent per project and session

A project-specific convention (for example, always use assert_raises instead of pytest.raises) belongs in context (CLAUDE.md or a skill file). A systematic reasoning failure belongs at the model layer and is Anthropic's problem to fix. A tool that is broken for every Claude Code user belongs in the harness.

Key Takeaways¶

Agents accumulate knowledge at three layers: model (weights), harness (scaffold), and context (external configuration). Each has a different cost, reversibility, and scope.
Most improvement opportunities target the context layer (agent memory) — it is cheapest, fastest, and easiest to reverse.
Model fine-tuning is rarely the right first response to a recurring agent failure; exhaust context and harness options first.
Traces are the input for improvements at all three layers; trace collection quality determines improvement velocity.

Agentic Flywheel: Self-Improving Agent Systems
Harness Engineering
Agent Memory Patterns
CoALA Memory Taxonomy Classifier — companion taxonomy organized by classify-what, distinct from this update-target / persistence-scope axis
Memory Retrieval as a Control Decision
Scaffold Architecture Taxonomy for Coding Agents
Layered Mutability
Memory Synthesis: Extracting Lessons from Execution Logs
Continuous Agent Improvement