Skip to main content

Judgment Burnout

AI agents don't reduce work. They compress it. The compression lands on the one layer that doesn't scale: human judgment. The 22-year-old who thinks the answer is more agents, more code, less sleep is on track for a wall they can't see yet.


The new shape of work

A useful way to frame what is happening, from Teng Yan:

AI agents don't reduce work as much as they compress it. Less typing, more judgment. Less waiting, more context switching. Less execution, more supervision.

The pre-AI workday had natural pauses built in. Typing took time. Waiting on a build took time. Walking to a meeting took time. The slow parts let your brain breathe between decisions.

Compression strips out the slow parts. What remains is one decision after another, with no fallow space between them. A practitioner running three agents and reviewing their outputs is not doing three times the work. They are doing one stream of work where every minute is a judgment minute.

That is a different job than the one most knowledge workers trained for.

Why this burns differently

Typing fatigue and judgment fatigue are not the same currency.

Typing drains your hands and your eyes. You can recover from it inside a workday. Sit, stretch, walk, come back, keep going. Eight to ten productive hours is achievable on most days because the load is distributed across muscles and attention.

Judgment drains a single resource: the part of you that decides. More attention. More context switching. More verification. More decisions per hour. That resource has a hard daily ceiling, and once it is gone, the only repair is sleep.

The honest math for a practitioner running multiple agents at once is closer to four or five extremely intense hours, then numb until they reset. Not because they are weak. Because the work has been compressed onto a layer with a real limit.

The agent can run 24/7. The human still has a hard limit. Pretending otherwise is how you cook your brain.

The trap young ambitious operators are walking into

The pattern is predictable and worth naming directly:

  • Spin up more agents.
  • Ship more code.
  • Sleep less.
  • Outwork everyone.

For a few weeks it feels incredible. Multiple agents running. Tasks flying. Outputs reviewed, mistakes fixed, decisions made, the whole loop moving. It looks like a 10x version of the old job.

Then the wall arrives. The numbness sets in earlier each day. Decisions get sloppier. The thirteenth agent run of the morning gets approved without the same scrutiny as the third. Errors compound. Quality drops. The operator either keeps grinding (and ships slop) or burns out hard enough that they have to step away.

We are already seeing this in the AAS network. A lot of practitioners in their early twenties don't say it out loud, but you can tell. Friends are quietly burnt out. Stretches of brilliance bracketed by stretches of fog.

The reflex when this happens is to assume the answer is more leverage: another agent, another harness, a heavier setup. That reflex is wrong. The bottleneck moved. Adding leverage to a bottlenecked system makes it more expensive without making it faster. See you are the bottleneck.

Productivity theater

Token spending is the most visible form of productivity theater in the AI era. You can rack up enormous token bills, run a dozen agents in parallel, and feel the heat of activity without producing anything that compounds.

Productivity theater shows up in patterns like these:

  • Spinning up agents to feel busy, not because the work needs them.
  • Optimizing tooling instead of doing the work the tooling is supposed to enable.
  • Counting outputs instead of evaluating outcomes.
  • Running expensive reasoning models on tasks that don't need them.
  • Letting agents grind on the wrong problem because watching them grind feels like progress.

The cost is not just the bill. The cost is the operator's judgment, spent on supervising work that didn't need to happen. Every hour of judgment burned on theater is an hour of judgment not available for the strategic question that would have made the work unnecessary.

This is the token economy failure mode at the personal level: thinking the goal is more tokens through the system, when the goal is the right tokens applied to the right problem.

Strategy is the only real lever

If judgment is the bottleneck, the only real way to scale is to make each judgment count for more.

That is what strategy is the new execution actually means in the body. Strategy is not a deck or a quarterly plan. Strategy is the daily question of: which problem deserves a judgment minute today? Which agent run is worth the supervision cost? Which output gets a careful review and which one gets a glance?

A practitioner with sharp strategy can run two agents and ship more value than a practitioner running ten agents on the wrong problems. The judgment they spend lands on work that compounds. The judgment the other practitioner spends evaporates into supervision overhead for outputs nobody needed.

The 4-5 intense judgment hours are not a problem to solve. They are a constraint to design around. The design move is:

  • Pick fewer, bigger problems.
  • Spend the front of the day on strategy, before the judgment is depleted.
  • Use the judgment line discipline so deterministic work never touches your judgment layer in the first place.
  • Treat the wall as a real signal, not a personal failing. Stop when you hit it. Do something restorative. Come back.

This is also why 80% of the value of a Personal Agentic OS comes from using AI as a strategic thinking partner. The strategic layer is where each judgment minute multiplies. The execution layer is where each judgment minute decays.

What "high output" actually means now

Old definition: how much you produced today.

New definition: how much of what you produced was worth producing, judged a week later.

The compression makes that distinction matter. You can produce a lot of code, a lot of artifacts, a lot of agent runs and have very little of it survive contact with reality. The operators who will hold up over years are not the ones who pushed the most tokens. They are the ones who pointed their judgment at the right targets and protected the resource that made that pointing possible.

For young ambitious operators: the goal is not to outwork everyone. The goal is to outthink them, then let your agents handle the execution under your direction. The leverage is in the strategy. The wall is real. Plan around both.


Further Reading

  • Strategy Is the New Execution: The structural reason judgment is the new bottleneck. Where each judgment minute multiplies.
  • The Judgment Line: The design rule that keeps deterministic work out of your judgment layer. Code does plumbing, the LLM does routine judgment, you do the judgment that matters.
  • You Are the Bottleneck: The personal version. Adding agents and tools to a judgment-bottlenecked operator makes the bottleneck more expensive, not faster.
  • The Token Economy: The pricing layer that makes productivity theater feel cheap until it isn't. Tokens as a recruiting tool and a multiplier, but only when the strategy underneath is right.
  • Crutching: The sibling failure mode. Judgment burnout is overuse of your own judgment on the wrong work. Crutching is underuse of your judgment because you handed it to the model.
  • Slop Factory: What gets produced when burnt-out operators stop reviewing carefully. Volume without judgment is just expensive noise.
  • The Overconfidence Trap: Why the early honeymoon of running many agents feels like proof you can keep going. The signal lies right when you most need it.
  • LLM Psychosis: What burnt-out judgment actually ships. Generation without discrimination. The hard cap on parallel agents is set by your discriminatory bandwidth, not your subscription.