XO Docs
The Future of WorkPhase 1: Agentic WorkforceResearch: Why the Unit of Work

How environments help businesses

Persistent environments make intent cheaper to infer: the same unit of work costs ~42% less by run 50.

A persistent environment makes the same work measurably cheaper over time, because it converts uncertainty the agent would otherwise re-purchase on every unit into state it inherits for free.

The environment argument and the calculation model made the trust case for the environment: it is where verification and settlement become real. This page makes the economic case.

Intent gets cheaper with data

Every unit of work has two costs hiding inside its token bill: the cost of doing the work, and the cost of figuring out what the work is: reading the codebase, asking clarifying questions, rediscovering conventions, inferring what this user means by "done." The first cost is irreducible. The second is a function of how much the agent already knows:

C(uₜ) = c_exec + k · H(intent | Mₜ)

c_exec   tokens to execute the work itself (the floor)
H(·|Mₜ)  the agent's residual uncertainty about intent,
          conditioned on the environment's memory Mₜ after t units
k        tokens burned per unit of uncertainty resolved

Without an environment, nothing carries over: M is empty on every run, H stays at its cold-start value, and the agent pays the full intent cost on unit 1 and on unit 100. With a persistent environment, every completed unit deposits into Mₜ: the files touched, the definitions of done that were accepted, the corrections a human made, the conventions that survived review. Residual uncertainty falls monotonically with accumulated data, so the cost per similar unit decays toward the execution floor:

Mₜ ⊇ Mₜ₋₁   ⟹   H(intent | Mₜ) ≤ H(intent | Mₜ₋₁)   ⟹   C(uₜ) ↓ c_exec

This is the information-theoretic restatement of something every manager knows: a contractor's tenth job for you is cheaper than their first, because they stop asking questions they already know the answers to. The environment is what gives an agent tenure.

Tokens per similar unit of work
without an environmentwith a persistent environment
unit 1 (cold start: both pay full intent cost)unit 50 (environment pays only the execution floor)

Figure 5: Token cost per similar unit. Without an environment the agent pays the full intent-resolution cost every time (dashed); with one, uncertainty resolved once becomes state, and cost decays toward the execution floor (solid).

Same use case, with and without the environment

We extended the micro-benchmark to 50 sequential, similar units (a support queue), holding the work constant and varying only whether the agent runs inside a persistent environment. Cold-start intent cost is 2,600 tokens on top of a 3,000-token execution floor; with an environment, intent cost decays as H₀ · t⁻⁰·⁶ and the intervention rate falls as definitions of done sharpen (20% → 5%), since failed checks also get rarer as the environment learns what done means:

Unit 1Unit 10Unit 25Unit 50Total (50 units)
Without environment5,600 tok5,600 tok5,600 tok5,600 tok308,000 tok
With environment5,600 tok3,653 tok3,377 tok3,250 tok181,260 tok
Savings0%35%40%42%41%

Three readings. First, the environment pays for itself in volume: unit 1 costs the same either way, so the benefit is invisible in a demo and decisive in production. Second, steady-state cost approaches the execution floor: at unit 50 the agent spends 42% less for the same outcome, and that gap is durable because it comes from eliminated uncertainty, not from a better model. Third, the savings compound with the verification economics of the calculation page: fewer failed checks mean less rework, so the intervention rate and the token bill fall together.

The mechanism behind the curve is worth stating plainly. An agent without an environment faces fresh entropy on every unit: each run, it must re-resolve the same questions about this team's data, conventions, and intent. An agent inside a persistent environment faces constant data (the same files, the same memory, the same accumulating record), so the only entropy left in unit t is what is genuinely new about unit t. Stable data is what lets the agent learn at all; learning is what makes it cheap. For a business, this inverts the usual procurement logic: the asset is not the agent you rent, it is the environment you own, because the environment is where the cost curve lives.

Next: implications.

On this page