Glossary AI

Inference

The process of running a trained AI model to get a response. Every agent action runs inference, which costs money and time. Reducing inference calls — through caching, batching, or pre-computed shared state — is one of the highest-leverage optimizations in agent systems.

Related terms
Build with this on OTP

If this term shows up in your playbook, it belongs in your OOS.

OTP encodes coordination intelligence so teams of humans and AI agents can run on it.

Found an issue with this definition? Tell us and we'll fix it.