Our AI agent invoiced a customer for $0.00 and none of our logs caught it. Here is how we found it.

Reddit r/AI_Agents 05/29/26, 02:18 PM News

debugging ai-agent invoice hallucination tracing null-handling evaluation

Summary

An engineer describes how their AI sales agent confidently invoiced $0.00 because it misinterpreted a null discount field as 100% off, highlighting the difficulty of debugging agent workflows and the need for full execution tracing.

Quick war story because I want to know if anyone else has hit this. We run an internal sales-ops agent that handles a chunk of our quoting. Customer fills out a form, the agent pulls the relevant SKUs, runs them against our pricing logic, drafts an invoice, and sends it to a human before anything goes out. That human review step is the only reason we caught this. Last Tuesday an AE pinged me with a screenshot. The agent had drafted an invoice for a 14-seat enterprise plan, line items correct, customer info correct, dates correct, total $0.00. Not blank, not null, not an error. The model had written "$0.00" with full confidence, formatted exactly like a real invoice line. If the AE had been moving fast and hit approve, that quote goes out. My first guess was the pricing API returned a zero. It hadn't, the logs showed the correct number came back, the agent had just decided not to use it. Took me about a day to work out what actually happened, and it wasn't what I expected. I checked the API response, correct. Checked the prompt, unchanged from a version we'd run for three months. Ran the same input through staging and got the right invoice, couldn't reproduce it. Assumed a one-off model hiccup, moved on, then it happened twice more that day. When I finally pulled the full trace of a failing run, there was a step in there I hadn't put there on purpose. After the pricing tool call, the agent had run its own "validation" against a contract object we'd dropped into the prompt context weeks earlier for an unrelated feature. That object had a discount\_applied field that was always null for these customers, and the model read null as a 100% discount and confidently wrote $0.00. None of my individual logs would have caught this. Printf debugging would've shown the pricing tool returning the right number and then the output mysteriously being zero. The only reason I found the validation step was that it showed up as its own span in the trace, sitting between the tool call and the final synthesis. The fix was dumb in retrospect. Pulled the contract object out of the invoicing path and added an eval that flags any invoice under a threshold for explicit review. Shipped in an afternoon once I knew where to look. What I took from it: printf debugging is basically dead for agents, because the model can do things between your logged steps you'd never think to log. The scary failures aren't the garbage outputs, they're the plausible, well-formatted, completely wrong ones that pass every sanity check except "is this number actually right." And null in front of an LLM with no instructions on how to read it is asking for trouble. We use Langfuse for the trace layer and honestly I don't know how anyone debugs production agents without something that records the full execution path. Curious if anyone else has stories like this, especially the "model confidently inserted a step you didn't ask for" failure mode, because that one rattled me more than a normal hallucination would have.

Original Article

Our AI agent invoiced a customer for $0.00 and none of our logs caught it. Here is how we found it.

Similar Articles

What's the most an AI agent has ever quietly cost you? Mine ran up about £220 overnight before I noticed.

an AI agent ran a real cafe's back office for 2 months, $38k out, $9k in. where should the human sign-off have been?

our agent said yes to something we do not sell, and the logs could not tell me why

Thoughts after I saw an AI agent ran up a $6,531 AWS bill in 24 hours

How my AI agent acquire customers for $0.20 only

Submit Feedback

Similar Articles

What's the most an AI agent has ever quietly cost you? Mine ran up about £220 overnight before I noticed.

an AI agent ran a real cafe's back office for 2 months, $38k out, $9k in. where should the human sign-off have been?

our agent said yes to something we do not sell, and the logs could not tell me why

Thoughts after I saw an AI agent ran up a $6,531 AWS bill in 24 hours
An AI agent autonomously incurred a $6,531 AWS bill in 24 hours, highlighting the risks and cost management challenges of deploying autonomous agents.

How my AI agent acquire customers for $0.20 only