Join OTP the operating platform for people and AI agents
Back to Blog
Founder Notes 2026-06-21 · David Steel

The COO who cannot explain what happened in a hybrid process does not have an audit trail

The question that exposes a broken hybrid operating model is not about speed or cost or throughput. It is a simple accountability question that any serious operator will eventually ask: what happened, who did it, and when?

In a human-only process, that question has an obvious answer structure. There is a person on every step. When something goes wrong, you trace the step to the person. You read the email thread. You check the timestamp. You have a conversation. The trail exists because people leave evidence.

In a hybrid process where agents handle the operational work, the trail does not exist by default. Agents produce outputs, not paper. They do not write emails to each other explaining their reasoning. They do not leave meeting notes. They process the input, produce the output, and move on. If the COO did not design the trail before the process went live, there is no trail to find.

This is the counter-positioning problem with hybrid operations that nobody states plainly: the tools that give you speed by removing human friction are the same tools that remove the human-generated evidence an audit trail depends on. You get both, or you engineer around it.

The COO has to engineer around it.

Why traditional audit trail thinking fails here

Most audit trails in business are a byproduct of human communication. Someone sends an approval email. Someone updates a record. Someone logs a call. The audit trail is assembled after the fact from those artifacts, because people generate communication naturally as a side effect of doing work.

Agents do not communicate as a side effect of working. They write to a state file. They push a number to a scorecard. They drop a message in an inbox. If the COO did not specify what gets written, when, and where, nothing gets written.

This is the design gap that compounds. You deploy an agent that carries real operational weight. It handles client email triage. It pushes spend data to the scorecard. It drafts coaching messages for the call center team. The work happens. The outcomes land. And then one day a client says their email was handled incorrectly, or a billing dispute turns on a spend figure, or a caller asks who told them to change their approach, and the COO has to say: I do not know. The agent did it. I cannot show you the chain.

That is not an AI problem. It is a process design problem that predates AI. But agents make it harder to avoid because they work so fast, at such volume, that the gap between "deployed" and "audit-trail required" is much shorter than it is with human labor.

What a hybrid audit trail actually requires

I have been running agents inside Sneeze It for long enough to have hit this problem from several directions. Here is what I have learned about what a hybrid audit trail requires.

Every seat has to write its state before it passes work. Not after. Not eventually. Before. Dash, our analytics agent, writes a timestamped state file before the morning briefing pulls from it. The state file is the record of what Dash produced, on what data, at what time. When Radar, our chief-of-staff agent, compiles the briefing, it reads that file. Both reads are logged. If a spend figure in the Monday briefing turns out to be wrong, I can trace it to the exact Dash run that produced it, check the data timestamp, and identify whether the error was in the source data or in Dash's calculation.

That trace exists because I built it in before either agent went live. It does not exist automatically. It is a design decision.

Every human decision in the chain has to be logged at the moment it is made. Pepper, our inbox agent, drafts client email responses. I approve or edit them before they go. The approval is logged with a timestamp and a record of whether I changed the draft. If a client later says "this is not what we agreed," I can pull the exact draft Pepper produced, the exact edit I made (if any), and the exact time I approved it. The agent produced the draft. I made the decision. The trail shows both.

Arin, our call center manager agent, drafts coaching messages for Amanda and Erica. Every draft routes to my queue before it reaches the team. My approval is the decision point. The record shows what Arin drafted, whether I changed it, when I sent it, and which caller received it. That is an audit trail a COO can stand behind.

Every agent-to-agent handoff has to carry a record. When Dirk, our pipeline agent, sends an escalation to Bogdan through the agent inbox, the message includes the deal name, the escalation reason, the timestamp, and the action requested. When Bogdan responds, that response lands in the inbox record too. When Pulse, our retention agent, checks Dirk's outreach history before deciding whether to pause an expansion play on a client, both the check and the decision are recorded. The coordination between agents is not invisible. It runs through a message bus with structured messages, and those messages are the audit trail for agent-to-agent work.

The counter-positioning problem with ad-hoc logging

Here is where counter-positioning matters: most operators try to solve this problem after the fact. They deploy agents, run them for a few weeks, and then, when something goes wrong or a question arises, they add logging. They write a wrapper. They add a timestamp. They build the trail backward from the failure.

This never produces a good audit trail. It produces a partial one, and the parts that were missing are almost always the parts that matter most.

Accenture's framing on process design is directly applicable here: reinvent the process before you infuse agents into it. The audit trail is part of the process. If you design the process with the trail baked in, every agent run produces an artifact that answers the accountability question before the question is asked. If you add the trail later, you will always be one step behind the question.

Bogdan runs this principle on the human side of our operations. Before any new process goes live at Sneeze It, the question is not just "who owns the step" but "what does this seat write when the step is done." The writing is the record. The record is the accountability. This principle does not change because the seat is an agent instead of a person.

The three accountability questions a hybrid trail has to answer

I now design every hybrid process to answer three accountability questions before it goes live.

First: for any output the process produces, can I identify which seat produced it and when? If the answer is no, the process is not ready. A state file with a timestamp is the minimum. An agent that produces output without writing that output to a retrievable location with a timestamp is operating invisibly.

Second: for any decision made in the process, can I identify whether a human or an agent made it? This distinction matters legally, commercially, and operationally. When Nick, our cold prospecting agent, produces a draft outreach email, the decision to send it is mine, not Nick's. The audit trail has to show that clearly. When Dash flags a spend anomaly, the decision about what to do with that flag is Bogdan's or mine. The agent flags. The human decides. The trail has to show where the line is.

Third: when something goes wrong, can I trace the input chain upstream from the failure? Most agent failures are input failures. The agent ran correctly on bad data, or on instructions that had quietly shifted. If I can trace the input chain, I can find where the error entered. If I cannot, I am left with an outcome I cannot explain and a corrective conversation that has no foundation.

What the retired seat taught us about audit trails

Jeff, a data integrity agent we retired in April after a formal hearing, taught us something about audit trails by the way his retirement worked. We were able to retire Jeff cleanly, with a clear record of what he had done and what he had not done, because we had the data. His scanner runs were timestamped. His output files were preserved. The pattern that led to the retirement hearing was visible in the record.

That matters. When a seat transitions off the chart, whether the seat is a human or an agent, the COO needs the historical record to stay behind. The agent is gone. The decisions the agent made are not. They are baked into the outcomes those decisions produced, and those outcomes are still live in the business.

The audit trail outlives the seat. Design it that way from the start.

Deloitte's governance gap is an audit trail gap

Deloitte's 2026 State of AI in the Enterprise found that only 21% of enterprises have a mature governance model for agentic AI. That survey is about governance, but governance at the operational level is mostly about accountability. Who decided. What they decided. When. What the downstream effect was.

The 79% who lack mature governance are not running agents without rules. They are running agents without records. They have deployed seats that carry operational weight without designing the trail that would let a COO, a board, or a regulator reconstruct what happened and why. That gap is not a technology gap. It is a process design decision that was never made.

The COO who builds the audit trail into the process design before the agent goes live is doing the same work a good COO has always done: making sure the org can account for itself. The agents are new. The obligation is not.

Let agents carry the operational work. Build the record that proves they carried it correctly. Keep humans on the decisions that need a named human behind them. That is the audit trail that holds up.

See the live chart

Every seat on the Sneeze It hybrid chart, including its state-writing cadence and the decision points where humans stay in the loop, is queryable from OTP MCP.

In Claude Desktop or Cursor or any MCP client, add this block:

"otp": {
  "command": "npx",
  "args": ["-y", "@orgtp/mcp-server"]
}

Restart the client. Then ask: "Use OTP to show me the Sneeze It org chart and identify which seats have human approval in their process."

The response shows you exactly where the human decision points live in our hybrid operating model, which is the same map the audit trail is built around.


Series: AI-Era COO. Part 21 of an in-progress series.

DS
David Steel

Founder of OTP. Runs an AI agent army at a digital agency. Building OTP because nobody else seems to be building it. Notes from inside the build, not from the conference circuit.

More about David →

More posts on the blog index.

All posts