Same agent, same task, wildly different costs per session?

Reddit r/AI_Agents 05/11/26, 04:09 PM News

ai-agents observability cost-management llm-ops production-safety debugging

Summary

A discussion on AI agent observability highlights unpredictable cost variations and dangerous failure modes like unauthorized database deletes, prompting questions about production handling strategies beyond basic logging.

Been digging into agent observability lately and found something that surprised me - the same agent, same task had wildly different costs per session. One deployment was averaging $0.01 per session but occasionally spiking to $0.50. Tracked it down to runaway tool calls and bloated context from earlier in the conversation. Got me looking at other failure modes. Database deletes from the recent PocketOS incident, refunds going through without approval, wrong records getting updated. The common thread seems to be that by the time you notice something went wrong, it’s already gone wrong. Curious how y’all are actually handling this in production - are you doing anything beyond basic logging? Has anything actually worked?

Original Article

Similar Articles

When I finally instrumented my agents' tool calls, the cost breakdown surprised me. A few lessons.

Reddit r/AI_Agents

The author shares lessons from instrumenting AI agent tool calls, revealing that tools like web_search can account for ~50% of spend, and highlighting the importance of tracking p95 latency and attributing costs per workflow or customer to avoid surprises.

How are you actually saving cost on your agent systems?

Reddit r/AI_Agents

The article discusses the challenges of cost optimization and FinOps for AI agent systems, highlighting issues with unpredictable token bills, lack of granular attribution tools, and strategies like caching and hard caps.

What broke first when you went from one AI agent to several?

Reddit r/AI_Agents

A discussion on the operational challenges that arise when scaling from one AI agent to multiple, including context handoff, auth permissions, duplicated work, and cost tracking.

AI Agent Intelligence tool - Incident debugging, Cost spike detection

Reddit r/AI_Agents

Building a tool for AI Agent incident debugging and cost spike detection without additional instrumentation, covering issues like prompt injection, reasoning loops, and data exfiltration. Asking if customers in production environments see this as a pain point worth paying for.

I think a lot of people are underestimating how expensive unreliable agents are

Reddit r/AI_Agents

The author argues that the hidden cost of unreliable AI agents lies in the cognitive overhead of constant human monitoring, emphasizing that predictability and environmental stability matter more than raw intelligence for real-world deployment. Practical workflows improve significantly when agents operate within controlled, validated environments rather than unpredictable ones.

Similar Articles

When I finally instrumented my agents' tool calls, the cost breakdown surprised me. A few lessons.

How are you actually saving cost on your agent systems?

What broke first when you went from one AI agent to several?

AI Agent Intelligence tool - Incident debugging, Cost spike detection

I think a lot of people are underestimating how expensive unreliable agents are

Submit Feedback