A voice call is not “done” if the cost is still invisible

Reddit r/AI_Agents 06/05/26, 01:46 PM News

voice-agent cost-tracking calls logging api-monitoring ai-costs

Summary

A developer argues that voice call logs must include cost and token data, not just duration and status, to properly assess voice-agent economics, sharing a lesson from a stress test where cost fields were initially null.

For weeks my completed calls had the two fields I needed most set to null. `totalTokensUsed: null` `costCents: null` The calls were finishing. The transcripts existed. The status said completed. But I still could not answer the question that matters the moment you scale past demos: What did that call actually cost to complete? The first time the fields finally populated, it was on a long stress test: 764 seconds, 140,034 tokens. That number changed how I thought about the whole result row. Duration and final status were not enough. If the call can run long, retry, hit voicemail, get interrupted, or require human follow-up, cost has to sit next to the outcome. My rule now: before you trust voice-agent pricing, run ten ugly calls, not ten clean demos. For each completed call, I want: - duration - timeout/retry count - token/cost - outcome - owner - whether a human follow-up is still needed If cost lives in a separate dashboard, you do not have a completed-call receipt. You have a transcript and a bill you will discover later. What are people here logging per call before they call the economics “known”?

Original Article

Similar Articles

A phone call is not done when the audio ends

Reddit r/AI_Agents

The article argues that phone calls handled by AI agents are not complete when audio ends; the real test is whether promises (e.g., callbacks) are properly captured in the work queue with owners, deadlines, and evidence, rather than just having a nice transcript.

Are coding agents getting expensive, or are we measuring cost the wrong way?

Reddit r/AI_Agents

The article questions whether the real cost of coding agents includes hidden human oversight and debugging, arguing that true value should be measured by trusted output rather than raw token consumption.

When I finally instrumented my agents' tool calls, the cost breakdown surprised me. A few lessons.

Reddit r/AI_Agents

The author shares lessons from instrumenting AI agent tool calls, revealing that tools like web_search can account for ~50% of spend, and highlighting the importance of tracking p95 latency and attributing costs per workflow or customer to avoid surprises.

Same agent, same task, wildly different costs per session?

Reddit r/AI_Agents

A discussion on AI agent observability highlights unpredictable cost variations and dangerous failure modes like unauthorized database deletes, prompting questions about production handling strategies beyond basic logging.

6 months running a production voice agent for service businesses. The latency math is way harder than the demos suggest.

Reddit r/ArtificialInteligence

After 6 months running a voice AI agent for service businesses, the author reveals that real-world latency is bimodal (median ~800ms, p95 ~2.4s) and this p95 determines user perception. Issues like VAD misfires, function call degradation with long prompts, and TTS quality matter more than LLM choice, with multilingual support adding significant costs.

Similar Articles

A phone call is not done when the audio ends

Are coding agents getting expensive, or are we measuring cost the wrong way?

When I finally instrumented my agents' tool calls, the cost breakdown surprised me. A few lessons.

Same agent, same task, wildly different costs per session?

6 months running a production voice agent for service businesses. The latency math is way harder than the demos suggest.

Submit Feedback