I've killed more agents than I've kept. Sharing the patterns in what dies and why.

Reddit r/AI_Agents News

Summary

The author shares five patterns that consistently kill AI agents: too many jobs per agent, no human-in-the-loop for destructive actions, unstructured outputs, no spend caps, and lack of uncertainty escalation paths. Practical guardrails and a checklist for reliable agent deployment are provided.

Last 6 months I've shipped roughly 30 different agent attempts. About 8 are still running. The other 22 died in week 1 or week 2 for the same handful of reasons every time. Sharing the 5 patterns I keep watching kill agents, in case it saves someone the same lessons. Too many jobs in one agent Every "do everything inbound" agent I built died fast. The minute you ask one agent to triage email AND draft replies AND schedule meetings AND log tickets, edge cases multiply geometrically. Each individual job is fine alone. The combination explodes. The pattern that works instead: one agent, one job. Five small agents that each do one thing reliably beats one mega-agent trying to do five things. Boring but it's the truth. No human in the loop on destructive actions Anything that sends, posts, charges, or deletes — if you let the agent execute without approval, you will eventually pay for a mistake that costs more than the agent saves you. I learned this the embarrassing way when an inbox agent emailed a half-baked draft to an actual customer. Now: draft, queue it, ping me in Slack with approve/reject buttons, I tap one. Latency is fine. Public mistakes are not. Unstructured LLM output When you let the model return plain text and parse it downstream with regex or string matching, every fifth run breaks because the model phrased it slightly differently. Force structured JSON. Validate the schema before consuming. If parsing fails, retry once then fail loud (Slack alert, don't silently move on). Sounds basic. Almost every dead agent I look back at skipped this. No spend cap Twice in 6 months I had agent loops eat $200+ in API calls overnight because of a bug that turned into a polling loop. Now every agent has a hard monthly spend cap. When it hits, the agent pauses and Slacks me. The 90 seconds it takes to wire up saves hundreds. If you're on Anthropic or OpenAI, both have built-in spend limits per key. Use them. The agent doesn't know when it's wrong Agents that confidently hallucinate are worse than agents that say "I'm not sure, escalate." The ones that survived had explicit uncertainty paths baked into the prompt: "If you don't have enough information to answer with confidence, output {escalate: true, reason: '...'}." The ones that died kept being confidently wrong for days until I noticed. The meta-pattern across all five: agents don't fail loudly. They don't crash. They slowly produce bad output until you realize something's off downstream. Loud failures are easy. Silent ones kill you. My current sanity check before I trust an agent unattended: - Does it have exactly one job? - Is there an approval queue for anything destructive? - Does it output structured data with validation? - Is there a hard spend cap with alerting? - Is there an explicit "I don't know, escalate" path? If all 5 are yes, it usually survives past month 1. If any are missing, I can usually predict the week it dies. The other thing I've stopped doing: trying to build agents for tasks that happen less than weekly. The maintenance overhead almost always exceeds the saved time for low-frequency stuff. Agents work best on patterns that repeat often. Curious what's killed your agents. Are you seeing the same patterns, or different ones?
Original Article

Similar Articles

The Real Truth About AI Agents

Reddit r/AI_Agents

An experienced practitioner shares hard-won lessons from deploying 25+ AI agents to production, arguing that memory, orchestration, and auditability matter far more than model choice. The article details common failure modes like context loss and silent cost loops, and recommends a stack including Claude Sonnet 4, Pydantic AI, and dedicated memory layers like Octopodas.