Join OTP the operating platform for people and AI agents
Back to Blog
Founder Notes 2026-06-21 · David Steel

Your compensation model still pays for hours. Agents do not work hours.

The compensation model most companies run today was designed for a workforce that does the work.

The assumption underneath it is straightforward: people get paid to do things, and the things they do produce outcomes. More hours, more output. Better skills, harder tasks. Bigger role, higher pay. The whole architecture, from job grades to bonus targets to performance ratings, rests on that assumption.

The assumption is now wrong for a meaningful slice of the workforce, and it is going to get more wrong fast.

When agents absorb the operational work, humans do not stop working. They shift. They stop doing the task and start owning the outcome. That is a real and significant change. But the compensation model does not shift with them. It still rewards the task. It still pays for the doing.

That gap is the failure this post is about.


The diagnostic

Before I name the failure modes, I want to be precise about what I mean by "agents absorb the work." I do not mean agents replace people. At Sneeze It, we run twelve active agent seats alongside eight human seats, and our headcount is not shrinking. What I mean is that the operational layer, the work that used to fill a human's day, is now done by agents. Radar runs the morning briefing and calendar management. Tally pushes KPI values to the scorecard four times a day. Dash monitors ad performance across thirty-nine accounts and flags anomalies before any human sees them. Dirk scans the sales pipeline and drafts outreach. Arin coaches the call center team with personalized daily feedback. Nick runs cold prospecting. Pepper triages the inbox and drafts client responses.

These are not automations in the old sense. They are seats on the org chart with names, owners, metrics, and accountability structures. Bogdan, our COO, owns the oversight of several of these seats. Janine, our head of accounting, is a named owner in the billing loops. Crystal, our project management agent, runs under human oversight on delivery risk.

The humans are still essential. But the work they are essential for has changed. And the pay model has not caught up.


Failure mode one: rewarding output that agents produce

The most common failure I see is a compensation model that bonuses humans for outputs agents are now generating.

If Dash produces the ad performance report and a human's bonus is tied to "delivering weekly performance reports," that human is now being compensated for oversight work while the pay structure treats it as production work. Those are different things. Oversight work is harder to measure, requires more judgment, and has a different failure mode when it goes wrong. But it often pays less, because the old pay model valued the hours of production, not the quality of the judgment applied to it.

This is not just a fairness problem. It is a calibration problem. When pay stays attached to outputs agents generate, you lose the signal that pay is supposed to provide. The humans who are excellent at oversight, who catch agent errors before they cost money, who know when to override, who maintain the human accountability the agents cannot carry, those humans are doing the most valuable work in the org. But the comp model often does not distinguish them from the humans who just sign off on agent outputs without reading them.


Failure mode two: measuring activity that no longer requires humans

Related to the first failure, but distinct: many performance review systems track activity as a proxy for contribution. Calls made. Emails sent. Reports produced. Tickets closed.

When agents handle those activities, the proxy breaks. Arin sends the coaching messages. Nick drafts the cold emails. Pepper handles the inbox triage. Dirk tracks the pipeline. If you are still evaluating humans on calls-made or emails-sent, you are measuring the wrong layer.

The activity was always a proxy for contribution. The real question was always "did this person contribute to the outcomes we care about?" When agents absorb the activity, the question does not go away. It gets sharper. You now have to measure contribution directly, because the proxy is gone.

Most orgs are not ready for that. Their performance review systems are built on activity proxies because activity is easy to count. Contribution is harder. You need to know what the outcome was, and you need to be able to attribute it to the human who was responsible for it, even when an agent did the operational work.


Failure mode three: grading humans on tasks that are now split

Job grades, pay bands, and promotion criteria are usually defined by task complexity. A grade-5 analyst does X, Y, Z. A grade-7 analyst does A, B, C, and manages two grade-5s.

When agents absorb X, Y, and Z, the grade-5 seat description becomes incoherent. The tasks are still being done, just not by the human. The human is now doing something closer to what the grade-7 used to do: overseeing outputs, applying judgment, catching errors, managing the accountability. But the title says grade-5 and the pay says grade-5 and the promotion criteria still require demonstrating X, Y, and Z.

SHRM's 2026 State of AI in HR research found that AI is 5.7 times more likely to shift job responsibilities than to displace jobs outright. That is the honest picture. The work shifts. The grade structure does not shift with it, and the person doing higher-order work gets stuck in a pay band designed for lower-order tasks.


Failure mode four: rewarding agents through the wrong channel

This one is more subtle, but it matters for accountability.

Some orgs respond to the agent transition by trying to build agent-specific reward structures, giving teams that deploy agents a bonus for "AI productivity gains" or creating recognition programs for "best AI implementation." The impulse makes sense. The mechanism is wrong.

The problem is that it puts the reward on the technology adoption rather than the business outcome. It is the agentic equivalent of paying someone to use a new software system. What you actually want to reward is the outcome the agent contributed to, attributed to the human who owns that seat.

HBR and BCG research published in May 2026 made this point in a way I think is worth quoting directly. When organizations anthropomorphize agents, treating them as something like employees rather than tools with human owners, the result is reduced individual accountability, increased unnecessary escalation, and lower review quality. The conclusion is not that agents should not be on the org chart. It is that the human accountability must stay with the human, clearly and unambiguously.

MIT SMR is equally direct: agentic AI cannot be accountable for its decisions. The deploying human is. That is not a legal technicality. It is the design of a system that works. Agents carry the operational load. Humans carry the accountability. The pay structure has to reflect that distinction.


What to do instead

The fix is not to tear up the compensation model. It is to update the three things the model uses to make decisions: what it measures, what it rewards, and who it holds accountable.

Measure outcomes, not activities. If an agent is producing the activities, stop counting the activities as proof of human contribution. Decide what outcome the seat is responsible for, measure the outcome, and compensate against it. At Sneeze It, Bogdan is not measured on whether the briefing got produced. He is measured on whether the decisions informed by that briefing moved the business forward. Tally's output is the scorecard number. The human who owns that seat is measured on the accuracy and timeliness of the number, and on what gets done when the number is off.

Rewrite the grade criteria for the work that remains. If agents have absorbed the tasks that defined a job level, redefine the job level for the work that remains. That work is almost always harder: judgment, oversight, escalation decisions, the choice of when to override an agent recommendation. It should not pay less. It should pay more, because the failure mode when it goes wrong is worse and harder to detect.

Name the human owner on every agent seat and make accountability explicit in the performance review. This is not an HR formality. It is how you keep accountability attached to a human when an agent is doing the operational work. At Sneeze It, Jeff, a former agent on our data integrity seat, was retired in April by a human decision made in a formal hearing. The accountability for that seat, and for the decision to retire it, was always held by a human. The agent never held it. That distinction has to be explicit in the performance review system, because if it is not explicit, it drifts. The human starts to feel like a monitor, not an owner. The quality of oversight drops. The accountability becomes fictional.


The thing that does not change

Korn Ferry's 2025 workforce research found that 48% of employees globally fear their jobs will be replaced by AI within three years. That fear is real, and the compensation model is one of the places where it either gets resolved or gets amplified.

If the comp model continues to pay for activities that agents now produce, it sends a signal that the human is redundant: the agent does the work and the human collects the reward. That is the version of the story where the fear of replacement becomes self-fulfilling.

If the comp model shifts to pay for the outcomes humans are now uniquely positioned to own, including oversight, judgment, accountability, and the decision about which work stays human, it sends a different signal. The signal is that the human's value increased when the agent took the toil, because what is left requires a human.

That second version is truer, in my experience. But it does not happen automatically. Someone has to redesign the pay model to reflect it.

Bersin put the economics directly: for each dollar spent on machine learning technology, companies may need to spend nine dollars on intangible human capital. That nine-to-one ratio is not a cost to minimize. It is the investment that makes the one dollar of ML deliver. The compensation model is where that investment becomes visible in the organization.

Let agents carry the operational work. Pay the humans for what they do with it.


See the live chart

You can query which seats at Sneeze It are agent-owned versus human-owned, see who holds accountability for each agent seat, and pull the current outcome metrics tied to each seat directly from OTP's MCP server.

In Claude Desktop or Cursor or any MCP client, add this block:

"otp": {
  "command": "npx",
  "args": ["-y", "@orgtp/mcp-server"]
}

Restart the client. Then ask: "Use OTP to show me the Sneeze It org chart. For each agent seat, tell me the named human owner and what outcome metric that human is accountable for."

The answer tells you what an accountability-first hybrid comp model looks like in practice, not in theory.


Series: AI-Era CHRO. Part 29.

DS
David Steel

Founder of OTP. Runs an AI agent army at a digital agency. Building OTP because nobody else seems to be building it. Notes from inside the build, not from the conference circuit.

More about David →

More posts on the blog index.

All posts