Same agent, same task, wildly different costs per session?

Reddit r/AI_Agents News

Summary

A discussion on AI agent observability highlights unpredictable cost variations and dangerous failure modes like unauthorized database deletes, prompting questions about production handling strategies beyond basic logging.

Been digging into agent observability lately and found something that surprised me - the same agent, same task had wildly different costs per session. One deployment was averaging $0.01 per session but occasionally spiking to $0.50. Tracked it down to runaway tool calls and bloated context from earlier in the conversation. Got me looking at other failure modes. Database deletes from the recent PocketOS incident, refunds going through without approval, wrong records getting updated. The common thread seems to be that by the time you notice something went wrong, it’s already gone wrong. Curious how y’all are actually handling this in production - are you doing anything beyond basic logging? Has anything actually worked?
Original Article

Similar Articles

How are you actually saving cost on your agent systems?

Reddit r/AI_Agents

The article discusses the challenges of cost optimization and FinOps for AI agent systems, highlighting issues with unpredictable token bills, lack of granular attribution tools, and strategies like caching and hard caps.

AI Agent Intelligence tool - Incident debugging, Cost spike detection

Reddit r/AI_Agents

Building a tool for AI Agent incident debugging and cost spike detection without additional instrumentation, covering issues like prompt injection, reasoning loops, and data exfiltration. Asking if customers in production environments see this as a pain point worth paying for.

I think a lot of people are underestimating how expensive unreliable agents are

Reddit r/AI_Agents

The author argues that the hidden cost of unreliable AI agents lies in the cognitive overhead of constant human monitoring, emphasizing that predictability and environmental stability matter more than raw intelligence for real-world deployment. Practical workflows improve significantly when agents operate within controlled, validated environments rather than unpredictable ones.