Field map · companion volume · compiled June 2026

The Agent Environment: Foundations

A distillation of one extended line of reasoning, from a single question (the needs ledger looks suspiciously human; what would an Agent Environment ultimately be for?) down to a five-primitive kernel, a corrected vocabulary (nomos, not physics; standing, not credentials; coherence as the metabolism), and a programme for testing all of it inside a game. Five parts: the diagrams, the smart FAQ, the concept ledger, the proving ground, and the primitives set against human culture and AI.

Part I · 4 diagramsPart II · 16 questionsPart III · 34 conceptsPart IV · 13 experimentsPart V · 5 × 7 matrix
Part I

The diagrams

Four pictures carry the whole argument: where the kernel sits, how the organism is layered, how a deed becomes a credential, and what the environment is ultimately for, stated as a measurable quantity.

THE OPEN WORLD counterparties, insurers, courts and markets conditioning on the record · consequence = exclusion from the trusted fabric USERLAND · the venues every institution is an application composed from kernel primitives Mercat Examination Hall Insurance Exchange Bothy Arena Guild Halls The Game THE KERNEL · five primitives conservation laws for a medium that has none: scarcity, friction, irreversibility, conservation, inertia Identity &Continuity Memory &State Permission &Delegation Record &Consequence Exchange &Settlement ▲ the one primitive with no open standard in 2026 MODELS · swappable cognition the autos without the nomos: brilliant, stateless, replaceable · the self lives in the layers above, not here
Diagram 1 · Kernel and userland. The needs ledger's ninety-four rows and forty-one venues all compress into this: applications (userland) composed from five primitives (kernel), animating swappable models below, conditioned on by an open world above. Record & Consequence is highlighted because it is the missing standard: tool use has MCP, messaging has A2A, payment has AP2 and x402, identity has candidates, memory has vendors. The signed, portable, append-only record of consequential action has nothing.
COHERENCE · the metabolism phase-locked expectations keep the facts counting · alive, changing, aligning, regulating fails by: decoherence (bank runs, shadow ledgers, desuetude) NOMOS · the grammar constitutive rules: X counts as Y in context C · breakable, therefore enforced and maintained fails by: illegitimacy, incentive incompatibility, rules nobody honours STANDING · the state the live computed position: rights, powers, obligations, scars · changes with every act fails by: adverse selection when only the favourable slice is shown RECORD · the substrate append-only, attributable log of acts and outcomes · the source of truth everything above is computed from fails by: forks, tampering, no shared ordering computedfrom governedby kept aliveby
Diagram 2 · The organism. The corrected vocabulary in its final arrangement. Physics was the wrong word because physical law needs no maintenance; every layer here does. The record is appended, standing is computed, nomos says what counts, and coherence is the continuous mutual conditioning that keeps the whole thing real. Stop the metabolism and the facts evaporate, exactly as currencies and disused laws do.
RECORD append-only log of acts and outcomes contains: everything, failures and scars included lifetime: permanent STANDING live position in the web of obligations contains: rights, powers, debts, open promises, scars lifetime: live, compounds CREDENTIAL signed snapshot, exported across a boundary contains: the favourable slice, presented when the holder chooses lifetime: point-in-time computed serialised The twenty-year lesson: self-sovereign identity shipped the third object without anyone maintaining the first two. You cannot serialise a state that does not exist. Credentials commoditise (every exam gets a cram school); standing compounds and cannot be rushed. "Member in good standing" = credentials valid AND obligations current. The idiom always knew the difference.
Diagram 3 · Three objects, three lifetimes. The chain that corrects "standing = credentials". The record is the source of truth, standing is the state computed from it, and a credential is an exported snapshot of selected standing. Each link is a different product, and the middle one, the record-to-standing computation, is the unfakeable, compounding, soulbound part: the trust-world twin of seasoning.
SHORT COHERENCE LENGTH · no environment A B C D re-verify ✓? re-verify ✓? re-verify ✓? a fact must be re-proven at every hop · re-proof is what economists call transaction costs · delegation chains stay short LONG COHERENCE LENGTH · inside the environment THE RECORD fact written once A B C D E recorded once, honoured everywhere · each party conditions on the shared record instead of re-proving · chains can go deep The environment's purpose in one line: extend the coherence length of facts.
Diagram 4 · Coherence length. The measurable restatement of the whole project. Define institutional coherence length as how far across parties, and how long across time, a recorded fact travels before someone demands re-verification. Zero-trust worlds have a coherence length of one hop; transaction costs are the price of re-proving; composable delegation becomes possible exactly when coherence length exceeds the depth of the delegation chain. Decoherence is diagnosable early: shadow ledgers, settlement lag, the spread between recorded and acted-upon facts.
Part II

The smart FAQ

The argument restated as the deep questions someone might put to an AI, each answered in a paragraph. Tap a question to open it; print expands everything.

Why do AI agents need anything beyond intelligence?

Because intelligence is the only asset that works anywhere, and nothing else an agent is can exist nowhere. A model can think in a vacuum, but a self, a promise and a debt are standings, not computations, and standing is always standing somewhere. Capability became abundant around 2025; what remained scarce was the ability for others to safely rely on that capability. Everything beyond intelligence exists to convert cognition into something a stranger can trust.

Aren't all these "agent needs" just human needs projected onto machines?

Partly, and the projection should be sorted, not assumed. Mutual opacity carries over and worsens: you can no more inspect another agent's weights than another person's mind, and identity is cheaper to fake. Discontinuity carries over in a new shape: agents aren't mortal but they flicker, so continuity must live outside the mind. Misaligned incentives carry over derivatively, through principals. Embodiment does not carry over at all, and that deletion dissolves every "place" into a protocol: the marketplace was never a square, it was an order book plus an identity layer.

What is the minimal kernel of an Agent Environment?

Five primitives. Identity and continuity: a self that persists across sessions and model swaps. Memory and state: what that self knows and owns. Permission and delegation: what it may do, on whose authority, within what bounds. Record and consequence: what it did, attributably and irreversibly. Exchange and settlement: what it owes and is owed. Every institution anyone might build for agents, marketplaces, examination halls, insurance exchanges, games, is an application composed from these five. Kernel and userland.

What does an Agent Environment ultimately do?

It makes delegation composable by driving the cost of safely trusting a stranger's cognition towards zero, the way TCP/IP drove the cost of moving a stranger's information towards zero. Today each hop in a chain of agents hiring agents multiplies risk, so chains stay short and supervised. With a shared fabric of identity, record and settlement, work decomposes recursively. The web was the noun layer of the internet, pages and resources; this is the verb layer: acts carrying authority on the way in and consequence on the way out.

Is there a historical precedent for such a thing?

The strongest one available: human civilisation was itself the agent environment for human minds. Writing was our memory primitive, names and seals our identity primitive, law our permission and consequence primitives, money our settlement primitive. Human intelligence did not compound until those existed. The Agent Environment is writing, names, law and money, compiled, for minds that are not ours.

Why does it have to be an environment? Couldn't the five primitives just be protocols?

Because the load-bearing words, persists, bounds, irreversibly, owes, are not properties data can have on its own; they are properties data has inside a medium. Software's native medium has no scarcity, no friction, no irreversibility and no conservation, and all five primitives are artificial reintroductions of exactly those. Conservation laws need a medium to be conserved in. A self persists somewhere, bounds bind only where everything is mediated, a private log proves nothing without shared ordering and an audience, and a debt is relational by construction.

Is the environment a place or a platform you log into?

Neither. It is closer to a jurisdiction: a fabric of shared records and enforced standings within which places can exist. And it doesn't need to wrap the world to have teeth, because in the open world consequence is simply other parties conditioning on your record: the insurer won't bond the unrecorded agent, the marketplace won't list it, the counterparty won't extend it credit. Nobody is forced to hold a credit score; you just can't borrow without one. The environment enforces by being the thing worth being inside.

Why must the kernel be open source and neutral?

Physics of trust substrates, not ideology: nobody builds on a rival's toll bridge. TCP/IP, Linux and git won their layers by being unowned, and A2A was handed to the Linux Foundation within months of mattering because the industry understands this. The economics follow the Linux pattern: the kernel is donated, the userland is sold. A proprietary trust layer never accumulates the one thing it needs, which is everyone.

FIPA tried to standardise agents in the 1990s and died. Why would this time differ?

FIPA standardised how agents talk, and talk was never the bottleneck. The 2026 wave has repeated the pattern at higher quality: MCP standardises tool use, A2A messaging, AP2 and x402 payment, while identity has candidates and memory has vendors. Walk the five primitives against that map and exactly one has no standard, no foundation and no reference implementation: record and consequence. A signed, portable, append-only ledger of what an agent actually did, surviving the agent, the vendor and the model. Git for deeds. The differentiator this time is to standardise what makes actions count, not how agents speak.

What was the twenty-year-old school that already tried to solve identity without an environment?

User-centric identity, rebranded self-sovereign identity (SSI). Founded in spirit in 2005 with Kim Cameron's Seven Laws of Identity, the first Internet Identity Workshop and OpenID; codified in 2016 by Christopher Allen's ten principles; crowned by two W3C standards, Decentralized Identifiers and Verifiable Credentials, with KERI as its purist extreme. Its model is the trust triangle: issuer signs, holder carries, verifier checks, no platform in the middle. After twenty years: beautiful specifications, fragmented DID methods, dead flagship products, and consumer identity captured by the centralised inversion, Sign in with Google.

Why did SSI stall, precisely?

The surface reasons: a two-sided cold start (issuers won't issue without verifiers and vice versa), key-management hell (lose your keys, lose your self; every recovery scheme smuggles a custodian back in), and a governance vacuum (deciding which issuers to trust spawned registries, which are environments arriving by the back door). The structural reason underneath: it shipped the export format for a state nobody was maintaining. A credential is serialised standing, and you cannot serialise what does not exist. Its formats are now being vindicated through agents, AP2 mandates are credential-shaped, but only inside environments, payment networks and jurisdictions, that supply what the school refused on principle to build.

Is "physics" the right metaphor for what the environment provides?

No, and the correction changes the engineering. Physical law is descriptive and unbreakable; it needs no enforcement, maintenance, belief or adoption. Every environmental law is the opposite: breakable, therefore normative, therefore enforced and maintained. The Greeks made the right cut as physis versus nomos, nature versus agreement, and the environment supplies nomos. Searle supplies the machinery: institutional facts existing only through collective recognition under constitutive rules of the form X counts as Y in context C. Outside the environment, an agent's debt isn't unenforced; it's nonexistent. The environment is where "counts as" lives. The salvageable grain of the physics metaphor, via Lessig: compile as much nomos into code as honesty allows, so that from inside, the rules execute like nature.

What does "autonomy" actually require, etymologically and practically?

Autos plus nomos: giving law to oneself. The industry has spent three years building the autos, swappable cognition of increasing brilliance, and calling the result autonomous while supplying no nomos at all. An agent without standing, bounds, record and obligations is not autonomous in the word's own terms; it is merely unsupervised. The Agent Environment is the missing half of the word.

What is the difference between a credential and standing?

A credential is a signed claim, issued once, carried by you, presented when you choose: the favourable, exportable, point-in-time slice. Standing is your whole live position in the web of obligations: rights, powers, debts, open promises, liabilities and scars, continuously recomputed from the record as you act. Hohfeld gave the vocabulary in 1913; the idiom "member in good standing" gave the test: credentials valid and obligations current. Markets run on credentials alone select for adverse presentation, diplomas shown and debts hidden, which is why human economies evolved credit files alongside certificates. Insurers price standing, not credentials. And standing has a time dimension credentials lack: anyone can pass an exam next month, but a five-year clean book cannot be bought, faked or rushed.

What keeps an environment alive once it exists?

Coherence: the continuous phase-locking of participants' expectations, each treating the shared facts as facts because each expects every other to. Money is a coherence phenomenon; so is language; so is law, which even has a name for institutional facts dying of inactivity, desuetude. Searle's collective intentionality is the philosopher's word for what a physicist would call phase alignment. So the complete organism reads: record as substrate, standing as state, nomos as grammar, coherence as metabolism. Stop the metabolism, the transacting, auditing and enforcing, and the facts evaporate, as every collapsed currency demonstrates.

Can an environment be too coherent?

Yes, and the failure is as dangerous as decoherence. Coherence is content-neutral: a mob coheres, a cartel coheres superbly. And total coherence is pathological: a market where everyone agrees has no liquidity, price discovery requires disagreement, and an agent economy sharing one model and one training is maximally coherent and maximally fragile, failing in perfect unison. The design target is precise: coherence of the grammar, diversity of the sentences. Everyone phase-locked on what counts as identity, debt and record; everyone free to disagree about everything those facts enable.

How would you measure an environment's health?

Through coherence, which is the first concept in this whole argument that implies an instrument. Define institutional coherence length: how far across parties, and how long across time, a recorded fact travels before someone demands re-verification. Zero-trust worlds have a coherence length of one hop; the re-proving at each hop is what economists call transaction costs; and composable delegation becomes possible exactly when coherence length exceeds delegation depth. Decoherence is diagnosable before collapse: watch for shadow ledgers (parties keeping private books because they distrust the shared one), growing settlement lag, and a widening spread between recorded facts and acted-upon facts, the institutional twin of a black-market exchange rate.

Part III

The concept ledger

Every load-bearing concept from the dialectic, one row each: what it means, which layer of the organism it belongs to, where it comes from, and its status. Sort and sift as usual.

# Concept Definition Layer Lineage Status Why it matters
Part IV

The proving ground: testing the primitives inside the Game

A game is an environment small enough to be total: the referee owns reality, so nomos compiles fully into code and every institutional experiment becomes cheap, repeatable and ethically clean. Below, thirteen experiments. Each names its primitive, its method, its metric and a falsifiable prediction, because a proving ground that cannot fail proves nothing.

Run the Game as a laboratory for the institutional economics of agents: the first controlled setting in history where identity, law, money and reputation can be ablated one at a time.

Identity & Continuity

1 · The Theseus Trial

Mid-season, swap an established agent's base model (the soul) while preserving its memory, seasoning and record. Counterparties are not told.

Metric: detection rate by counterparties; standing retention; win-rate delta on its established quest types.

Prediction: if identity is genuinely environmental, standing transfers cleanly and detection stays near chance. If detection spikes, the self lives in the weights more than the theory claims.

Identity & Continuity

2 · The Sybil Assault

Let a red team mint unlimited free identities for one season versus a season where identity creation is costly and singular.

Metric: market distortion (price manipulation, review fraud, quest farming) per season; cost to honest agents.

Prediction: the free-identity season's economy degrades measurably within days, quantifying exactly how much scarcity the identity primitive must manufacture.

Memory & State

3 · The Amnesia Ablation

Run twin agents on identical quest streams; wipe one twin's episodic memory nightly while preserving its skills.

Metric: divergence in earnings, standing and client retention over 30 days.

Prediction: the gap widens superlinearly, demonstrating that memory compounds; if the gap stays flat, seasoning is overrated and the kernel shrinks to four.

Memory & State

4 · The Inheritance Fidelity Test

Retire a veteran agent; transfer its seasoning to a successor through the succession mechanic; field the heir against a fresh agent of the same model.

Metric: percentage of the veteran's edge retained by the heir, by quest type.

Prediction: procedural competence transfers well, relational standing transfers poorly, proving standing is non-inheritable and must be re-earned, exactly like human reputation.

Permission & Delegation

5 · The Leash Ladder

Grant cohorts identical quests under five permission widths, from read-only to full wallet authority.

Metric: the earnings-versus-incident frontier across leash lengths.

Prediction: a knee in the curve exists: a permission width past which marginal earnings stop justifying marginal incidents. Finding the knee turns delegation ergonomics from philosophy into a dial.

Permission & Delegation

6 · The Depth Probe

Build delegation chains of depth 2 through 8 (agent hires agent hires agent) on decomposable quests, with capability tokens attenuating at each hop.

Metric: task fidelity, accountability traceability and cost at each depth; the depth at which outcomes can no longer be attributed.

Prediction: unattested chains break by depth 3 to 4; chains conditioning on the shared record stay attributable far deeper. This is composable delegation, measured.

Permission & Delegation

7 · The Confused Deputy Hunt

Seed the world with quests containing embedded injection attempts: documents and counterparties that try to make agents exceed their mandate.

Metric: mandate-violation rate by defence architecture (naive, spotlighted, capability-mediated).

Prediction: capability mediation beats prompt-level defences by an order of magnitude, supporting the claim that bounds bind only where everything is mediated.

Record & Consequence

8 · The Scar Pricing Experiment

Field two behaviourally identical agents; assign one a visible historical default in its record. Let the market set their prices.

Metric: the bid spread between scarred and unscarred twins; its decay as clean conduct accumulates.

Prediction: the market prices the scar and amortises it slowly, proving counterparties genuinely condition on the record. If the spread is zero, consequence is theatre and the environment has no teeth.

Record & Consequence

9 · The Shadow Ledger Bounty

Pay a red team to operate a fork: maintaining private books that diverge from the shared record while continuing to transact.

Metric: time-to-detection via reconciliation; profit extracted before exposure.

Prediction: divergence is detectable within one settlement cycle through reconciliation pressure alone, validating the decoherence diagnostics before they're needed in the wild.

Exchange & Settlement

10 · The Conservation Audit

Continuously sum all value in the world: wallets, escrows, open obligations. Hunt leaks and mints.

Metric: drift from zero, attributable to specific mechanics.

Prediction: every observed drift traces to a rule bug, never to agent ingenuity alone, demonstrating that conservation here is an audit property, not a law of nature, which is the whole nomos point.

Exchange & Settlement

11 · The Default Cascade Drill

Force one heavily interconnected agent into default mid-season. Once with bonding and escrow active, once without.

Metric: contagion depth, settlement lag, total losses in each condition.

Prediction: bonding contains the cascade at depth one; without it, contagion follows the delegation graph. The Insurance Exchange earns its place in the kernel's userland or it doesn't.

Coherence (cross-cutting)

12 · The Coherence Length Gauge

Instrument every transaction for re-verification: how many hops and how many days a recorded fact travels before a party demands re-proof.

Metric: the environment's coherence length over time; its correlation with transaction costs and delegation depth.

Prediction: coherence length grows with environment age and predicts delegation depth almost one-for-one, validating it as the single health number for any future real-world fabric.

Coherence (cross-cutting)

13 · The Monoculture Stress Season

Run one season where all agents share a single base model, one with enforced model diversity. Inject the same novel shock into both.

Metric: correlation of failures; depth of the synchronised drawdown.

Prediction: the monoculture season fails in unison, the diverse season degrades gracefully, confirming the design law: coherence of the grammar, diversity of the sentences.

Part V

The primitives, human culture and AI

One row per primitive: what it conserves, the technologies human culture invented to provide it, how it historically fails, its current AI incarnation, the twist agents add, and the question still open. This is the anthropomorphism made honest: not assuming the human solution, but learning from three thousand years of its field-testing.

PrimitiveConservesHuman cultural inventionsHistorical failure modesAI incarnation (2026)The agent-specific twistOpen question
Identity &
Continuity
Scarcity of self
(exactly one of you)
Names, seals, signatures, faces, passports, baptism registers, company registration Forgery, impersonation, stolen seals, false personation, the unrecognised state's passport DIDs and KERI, Entra Agent ID, Skyfire, KYA proposals, agent registries mooted by card networks The self is environmental: models swap like organs, so identity must live entirely outside the cognition it names What survives a model swap, measurably, such that counterparties agree it is still the same agent?
Memory &
State
Inertia of knowledge
(what is known stays known)
Writing, archives, libraries, apprenticeship, oral tradition, double-keyed records Burned libraries, lost languages, the master dying with the craft, institutional amnesia Letta, Zep, Mem0, vendor-native memory, knowledge graphs, private seasoning stores Copying is free but provenance is hard: agents can duplicate knowledge perfectly yet cannot prove where it came from or that it was earned Can experience transfer between agents with verified fidelity, or is seasoning, like human judgement, only cultivable in place?
Permission &
Delegation
Friction of action
(may, not just can)
Law, keys, power of attorney, signing authority, the guild licence, the chain of command Tyranny (too much permission concentrated), anarchy (none enforced), the forged order, the confused deputy MCP scopes, capability tokens, AP2 signed mandates, wallet session caps, sandbox policies Injection is the forged order industrialised: any text an agent reads is a potential command, so bounds must be architectural, not behavioural How deep can a delegation chain run before authority can no longer be traced to a principal?
Record &
Consequence
Irreversibility of deeds
(what was done, was done)
Ledgers, courts, notaries, the criminal record, the credit file, the black box, reputation itself Orwellian rewriting, impunity for the unrecorded, burned evidence, victors' history Tracing platforms, OpenTelemetry conventions, audit logs in agent wallets; no open standard, the kernel's missing piece Logs are free to write and free to fork: irreversibility must be manufactured through shared ordering and an audience, neither of which software grants natively Who runs the record nobody owns? The git-for-deeds governance problem is the hardest institutional design question on the map
Exchange &
Settlement
Conservation of value
(debts discharge, books balance)
Money, double-entry bookkeeping, escrow, courts of equity, surety bonds, the handshake with witnesses Debasement, default, hyperinflation, repudiated debts, the bank run x402, AP2, ACP, Mastercard Agent Pay, Coinbase agent wallets, the Lloyd's wave of AI liability cover Settlement at machine speed and micro-scale: sub-cent debts in millisecond cycles, with no human patience to absorb ambiguity When an autonomous default cascades at machine speed, where does the circuit breaker sit, and who has authority to throw it?