Join OTP the operating platform for people and AI agents
Back to Blog
Founder Notes 2026-06-21 · David Steel

Some work should stay human. Here is how to tell which work that is.

The most seductive mistake in building an agent org is trying to figure out everything agents can do.

That question has no bottom. Agents can do more than you think, and the ceiling is rising every quarter. If you organize your team around what agents are currently capable of, you will be reorganizing again in six months. The capability boundary is not a stable place to draw a line.

The right question is different. It is not "what can agents do." It is "what should stay human, and why."

That question has a more durable answer. And the answer matters for the quality of every decision your hybrid org makes from this point forward.

The counter-position that changed how I think about this

There is a productive split in the current literature on agent management that I want to name directly, because it has shaped how we run things at Sneeze It.

One camp, represented by MIT SMR research from late 2025 and HBR's emerging "agent manager" framing, argues that agentic AI should be managed more like a human coworker than like a traditional tool. Sixty-nine percent of the experts MIT SMR surveyed agreed that agentic AI demands new management approaches. The logic is that agents are complex, non-deterministic, and consequential enough that traditional IT governance is insufficient.

The other camp pushes back hard. HBR published research in May 2026, with BCG, that found a different problem: when organizations treat AI agents like employees, outcomes degrade. Specifically, individual accountability fell, unnecessary escalation increased, and review quality dropped. Their recommendation was to frame agents as "rented contractors with a narrow statement of work," governed by scoped permissions, kill switches, audit logs, and named human owners, not HR onboarding or performance titles.

Both camps are right about the same underlying thing. Every agent needs a named human owner. Every agent needs a measured seat with observable outputs. Accountability must remain with a person, not with the agent. MIT SMR is explicit on this: agentic AI cannot be accountable for its decisions; the deploying human is.

The disagreement is about framing, not about substance. And the framing matters, because the wrong frame generates real errors.

At Sneeze It, we do not onboard agents like employees. We do not give them performance reviews in the HR sense. What we do is give every agent a seat with a named owner, a clear metric, and a human decision at every inflection point. When Jeff, our former data integrity agent, was no longer earning his seat, I made the retirement decision. A human made it. Jeff did not resign. There was a hearing. The accountability was mine.

That is accountability architecture, not anthropomorphizing.

What agents carry well

Once you are clear on where accountability lives, the question of what agents should carry becomes more tractable.

Agents carry operational work well. Repetitive, high-frequency, rules-based, measurable work. The kind of work that benefits from consistency, speed, and 24-hour availability. The kind of work where variation is a defect, not a feature.

Radar, our chief-of-staff agent, runs daily briefings. He scans Slack channels, compiles calendar data, reads the sales pipeline, flags stale items, and writes a structured summary to our shared daily note every morning before I am at my desk. That work is operational. The value is in it being done reliably, at the same cadence, in the same format, every single day.

Dash reads every ad account we manage, roughly $136,000 in monthly spend across 39-plus accounts, and surfaces patterns against yesterday's baseline and the seven-day average. That work is high-frequency and high-stakes, but the value is in consistency of coverage, not in judgment calls about what the patterns mean.

Tally pushes our KPI values from local source files to the scorecard four times a day on weekdays. Arin analyzes call center data and drafts coaching messages for review. Nick runs cold prospecting at a volume no human would sustain. Crystal tracks project delivery across our Accelo instance.

Every one of those seats carries operational throughput. They carry it so that the humans on the team are not carrying it. That is the point.

The Bersin framing I find most useful here is the $9 principle: for each dollar spent on machine learning technology, companies may need to spend nine dollars on intangible human capital. The agent is the dollar. The human capital it frees up is the nine. If your agents are not freeing up human capacity for something more valuable, you are paying for the dollar and throwing away the nine.

What should stay human

The work that should stay human is the work where variation is not a defect. Where judgment, care, and context cannot be reduced to a prompt.

I run this as four categories.

Accountability decisions. The moment when someone has to own a result and answer for it, to a client, to a partner, to a team member, to a governing body, that moment stays human. Dirk, our sales agent, finds pipeline signals and drafts outreach. I approve every message before it goes. The accountability for what gets said to a prospect under our name is mine, not Dirk's. Agents can draft. Humans sign.

Relationship calls where trust is on the line. When a client is frustrated, when a partnership is fragile, when a hiring decision carries real risk to someone's livelihood, the human stays in the room. I have Arin draft coaching messages for our call center team, but I read every one before it posts. The relationship between me and those callers is mine. The agent assists it. The agent does not hold it.

Culture, conflict, and ethical judgment. SHRM's 2026 data shows AI is 5.7 times more likely to shift job responsibilities than to eliminate them, and three times more likely to create new roles than to displace jobs. What gets created are roles that require humans to be present in ways agents cannot be. When there is a conflict between two team members, when a decision carries values implications, when the right answer requires context that lives outside any data source, the human is the answer. Bogdan, our COO, makes calls that require organizational judgment. Janine makes calls that require accounting ethics. No agent on our team is being positioned to replace those calls.

The decision about which work stays human. This is the recursive one. The decision about where to draw the human-agent line is itself a human decision. It requires judgment about risk, about values, about what the organization stands for, and about what quality of work is actually needed in a given seat. Deloitte's 2025 research found that 73 percent of leaders say middle-manager role reinvention is critical but only 7 percent report great progress. The bottleneck is not knowing what to reinvent toward. It is knowing what to protect.

The line is not about capability

Here is the practical test I run when I am evaluating whether a piece of work should move to an agent seat or stay human.

First: what is the quality signal? If quality is measurable and consistent quality is the goal, agents handle it. If quality requires judgment about a specific situation, humans handle it.

Second: who answers when it goes wrong? If the answer is a machine, the seat has no real owner and you will discover that at the worst moment. If the answer is a person with a name, the seat can be accountable.

Third: does variation here create value? If the answer is yes, the work benefits from a human. Agents produce consistent output. Humans produce contextual output. When contextual is what you need, agents are the wrong tool regardless of capability.

Fourth: is this work where the relationship is the product? Client trust, team culture, partnership health. These are not outputs of work. They are the work. Agents support these. They do not hold them.

Only 6 percent of leaders in HBR's December 2025 survey fully trust agents with core processes. That is not a confidence problem to be solved with better technology. It is a structure problem. When the accountability is explicit, when the human owner is named, when the observability is real, trust becomes appropriate rather than excessive. The question is not whether to trust agents. It is what to trust them with.

What this looks like in practice

At Sneeze It, our hybrid org chart has roughly twelve agent seats and a smaller number of human seats. The agents carry the operational throughput. The humans carry the decisions, the relationships, and the accountability.

Radar briefs me. I decide what to act on. Dirk drafts outreach. I approve it. Dash surfaces patterns. I decide what they mean for a client relationship. Arin coaches the call center team with my review at every step. Nick finds prospects. I decide who to pursue.

The agents are not in the room for the hard calls. They prepare the room. They fill it with data, drafts, signals, and patterns. Then the humans make the call.

This is the shift Korn Ferry's research points at when it says only 42 percent of CHROs are prioritizing AI investment and only 5 percent feel fully prepared. The preparation gap is not technical. It is structural. It is the gap between deploying agents and knowing what to protect when you do.

Let agents carry the operational work, so people are free for the work that matters.

That sentence sounds simple. Executing it requires being clear about which work matters, and clear-eyed about why it has to stay human.

See the live chart

Every seat on our org chart is queryable via OTP MCP, including which seats are agent-owned versus human-owned, who the named human owner is for each agent seat, and what metric each seat is accountable for.

In Claude Desktop or Cursor or any MCP client, add this block:

"otp": {
  "command": "npx",
  "args": ["-y", "@orgtp/mcp-server"]
}

Restart the client. Then ask: "Use OTP to show me the sneeze-it org chart and identify which seats are held by agents versus humans, and who owns accountability for each agent seat."

The response will show you what the line between human and agent work looks like when it is made explicit in a live organization.

DS
David Steel

Founder of OTP. Runs an AI agent army at a digital agency. Building OTP because nobody else seems to be building it. Notes from inside the build, not from the conference circuit.

More about David →

More posts on the blog index.

All posts