The hard part of agents is not building one. It is operating five.

Reddit r/AI_Agents 05/12/26, 02:21 PM News

agents operations observability deployment reliability tooling mcp

Summary

The article discusses the operational challenges of running multiple AI agents in production, emphasizing observability, recovery, and session management over the initial development of a single agent.

A pattern keeps showing up in agent threads here: the first agent is not the hard part. The hard part starts when you have several agents running repeatedly, with tools, state, approvals, retries, and partial failures. The questions become less glamorous: - Which agent ran this task? - Which tools or MCP servers were available? - What did it change? - Did it stop, fail, or wait for approval? - Which verifier/test phase passed it? - Can I replay or compare this run against the last good one? - What do I do when context runs out mid-task? I think a lot of agent reliability work is really agent operations work. Frameworks help build the agent, but teams still need an operating surface around runs, sessions, tools, approvals, and recovery. Curious how others here are handling this today. Are you using LangSmith-style traces, custom dashboards, Temporal/workflows, git worktrees, spreadsheets, or just logs and vibes?

Original Article

Similar Articles

The Real Truth About AI Agents

Reddit r/AI_Agents

An experienced practitioner shares hard-won lessons from deploying 25+ AI agents to production, arguing that memory, orchestration, and auditability matter far more than model choice. The article details common failure modes like context loss and silent cost loops, and recommends a stack including Claude Sonnet 4, Pydantic AI, and dedicated memory layers like Octopodas.

@_avichawla: https://x.com/_avichawla/status/2071897559287955680

X AI KOLs Timeline

The article discusses that the real challenge in AI agents is not building them but running them in production, and proposes the need for an operating system layer to manage fleets of agents, akin to how an OS manages software processes.

"At what point does adding another agent actually hurt your system? Asking because my 6-agent pipeline is slower and less reliable than my old 2-agent one

Reddit r/AI_Agents

A developer shares real-world experiences with AI orchestration frameworks (LangGraph, CrewAI, AutoGen), noting trade-offs between ease of prototyping and production reliability, and asks the community about handling failures, human-in-the-loop, and token costs.

the boring part of AI agents nobody builds and everyone needs

Reddit r/artificial

A practitioner recounts how deploying AI agents in production required 80% engineering effort on workflow, ownership, and approval processes rather than the model itself, highlighting that the 'boring layer' of shared context and routing is critical for real-world impact.

Has anyone deployed a multi-agent AI employee in production?

Reddit r/AI_Agents

A discussion about deploying multi-agent AI systems in production, where different agents handle planning, execution, communication, and project management, asking about real-world experiences and bottlenecks.

Similar Articles

The Real Truth About AI Agents

@_avichawla: https://x.com/_avichawla/status/2071897559287955680

"At what point does adding another agent actually hurt your system? Asking because my 6-agent pipeline is slower and less reliable than my old 2-agent one

the boring part of AI agents nobody builds and everyone needs

Has anyone deployed a multi-agent AI employee in production?

Submit Feedback