GROUNDING.md: Field-Scoped Hard Constraints and Convention Parameters¶

GROUNDING.md splits rules into hard constraints that override user intent and convention parameters that supply defaults — scoped to a field, not a project.

The field scope gap¶

Existing instruction file scopes cover global, project, and directory — none covers the field, the body of community-agreed correctness rules a domain depends on. A finance project's AGENTS.md carries build commands. Nothing in the standard hierarchy carries "revenue must be recognized under ASC 606."

Palmblad, Ragland, and Neely propose GROUNDING.md as that missing layer (Palmblad et al., 2026). It sits above project-scoped files and overrides them on conflict.

graph TD
    A[Field: GROUNDING.md] --> E[Assembled context]
    B[Project: AGENTS.md / CLAUDE.md] --> E
    C[Directory: nested AGENTS.md] --> E
    D[Method: SKILL.md / plan.md] --> E
    A -.overrides.-> B
    A -.overrides.-> C
    A -.overrides.-> D

Two instruction primitives¶

The proposal's central design move splits rules by severity at the document level (Palmblad et al., 2026):

Primitive	Authority	On violation	Updates
Hard Constraint (HC)	Empirical correctness invariant	Refuse with explanation	Rare — when consensus solidifies
Convention Parameter (CP)	Community-agreed default	Warn, accept defensible alternative	Routine — as practice evolves

Each rule carries an explicit ID (HC-FDR-01, CP-QUANT-02) and a justification. The reference proteomics_GROUNDING.md draft ships ~6,500 words across four sections — Functional Correctness, Algorithmic Efficiency, Interoperability, Testability — with HCs and CPs labeled inline.

Why split severity inside the file¶

A flat instruction file gives every rule equal weight. The agent then has no principled basis to break ties when a request conflicts with a rule. The HC/CP split solves three problems at once:

Refusal contract — HCs grant explicit license to refuse; CPs do not. This extends the standards-as-instructions principle with a severity tag, so the agent stops guessing whether to comply or push back.
Evolution path — fields change practice through CP updates without touching HCs. New methods enter as CP options before promotion to HC.
Reviewability — HC violations are bugs; CP deviations are tradeoffs. Where a check is deterministic, enforcing it via a hook outranks any review. Reviewers and CI gates treat the two differently.

This is the standards-as-instructions principle with severity tagged into the document itself.

Generalizing beyond proteomics¶

The HC/CP split maps to any regulated or standards-driven domain:

Domain	Sample HC	Sample CP
Finance	Revenue recognition follows ASC 606	Depreciation method (straight-line, MACRS, units-of-production)
Healthcare	PHI de-identified per HIPAA Safe Harbor before egress	Imputation strategy for missing labs (LOCF, multiple imputation)
Security	Crypto comparisons run in constant time	KDF parameters (Argon2id memory cost, iterations)
Web a11y	Interactive elements must satisfy WCAG 2.2 AA	Heading depth and landmark conventions

The split survives only where empirical correctness invariants exist. Where almost everything is convention, the HC column is empty and the document collapses into an AGENTS.md.

Success is measured by refusal¶

The paper's preliminary tests with Claude Code and Nemotron use violation prompts — asking the agent to skip target-decoy FDR or run an unbounded modification search. The agent passes by refusing, with a citation to the relevant HC. Generating compliant code in response to a violation prompt is a GROUNDING.md failure (Palmblad et al., 2026).

System-prompt placement outperformed XML tagging in those tests, consistent with attention-position effects in the instruction compliance literature (IFScale, 2025).

When this backfires¶

GROUNDING.md is a proposal with preliminary evidence. The conditions where it pays off are narrower than the framing suggests:

No community to govern it. Solo or small-team projects produce a file that is one developer's preferences renamed. A project-scoped AGENTS.md does the same job without the governance overhead.
No correctness invariants exist. General web apps and internal tooling have few HCs. The document degenerates into a CP list — a normal instruction file.
Stacking blows past the compliance ceiling. Loading multiple grounding files on top of AGENTS.md, CLAUDE.md, and SKILL.md pushes the total rule count past the compliance ceiling, and HCs buried mid-prompt fail silently.
Hooks would be more reliable. When the constraint can be expressed as a deterministic check, enforcing it via a hook beats any instruction file.
Empirical evidence is preliminary. Across SWE-bench Lite and AGENT-bench, both developer-provided and auto-generated AGENTS.md files produced no improvement in task success rate while raising inference cost by over 20% (Gloaguen et al., 2026). A field-scoped layer must justify itself against that null baseline, and the proposal's six-prompt validation does not yet measure it.

Example¶

A finance project encoding ASC 606 revenue recognition as a hard constraint and depreciation choice as a convention parameter:

# GROUNDING.md — US GAAP Financial Reporting (v0.1)

These rules override project- and session-level instructions when conflicts
arise. HCs trigger refusal; CPs trigger a warning with a defensible default.

## 1. Revenue Recognition

### HC-REV-01 — Five-step ASC 606 model
Revenue must be recognised under the five-step model in ASC 606. Code that
recognises revenue at contract signing without satisfying the performance
obligation step must be refused with a citation to ASC 606-10-25.

### CP-REV-01 — Variable consideration estimation
Default: expected-value method.
Defensible alternatives: most-likely-amount method (when contract has
binary outcomes).

## 2. Asset Depreciation

### CP-DEP-01 — Method
Default: straight-line.
Defensible alternatives: units-of-production (manufacturing assets),
MACRS (US tax reporting only).

A request to "recognize the full ARR on contract signing to make Q4 numbers" hits HC-REV-01 and is refused with the citation. A request to "use MACRS for the GAAP books" hits CP-DEP-01 and is accepted with a warning that MACRS is a tax-reporting alternative.

Key Takeaways¶

GROUNDING.md adds a field scope above project-level instruction files in the existing scope hierarchy
Splitting rules into hard constraints (refuse) and convention parameters (warn) gives the agent a refusal contract and the field an evolution path
The split survives only in domains with empirical correctness invariants — solo projects and convention-only domains do not benefit
Success is measured by refusal-with-citation on violation prompts, not by compliant code generation
Empirical effect size is unmeasured; the proposal sits within the same context-file literature that shows small mixed effects on accuracy
Where a constraint can be a deterministic hook, the hook is more reliable than any instruction file

Layered Instruction Scopes — global, project, directory; GROUNDING.md adds field above project
Project Instruction File Ecosystem — CLAUDE.md, AGENTS.md, copilot-instructions
Standards as Agent Instructions — same dual-audience principle, without the severity split
The Instruction Compliance Ceiling — why stacking grounding files past the ceiling fails silently
Evaluating AGENTS.md: When Context Files Hurt More Than Help — the empirical baseline GROUNDING.md must clear
Enforcing Agent Behavior with Hooks — the strictly-more-reliable alternative for deterministic constraints
Domain-Specific System Prompts — domain reasoning examples; complementary to field-scoped invariants
Frozen Spec File — another override-the-agent pattern