A community discussion on agent memory reveals that while various patches exist for what to write down (e.g., plain files, layered memory, post-mortems), the unsolved problem is what to keep—detecting failures is tractable, but deciding which lessons persist still needs human judgment.
Posted a question here about why agents remember what sounds related instead of what actually worked. Got a bunch of genuinely thoughtful replies, so here's what I took away, because the pattern across them turned out to be more interesting than my original question. Almost nobody defended pure vector search. The recurring frustration was the same one a few people put really well: embeddings are good at "what looks similar," but what an agent actually needs is "what worked" and "what failed," and those are different retrieval problems. Someone called the missing piece "negative memory," the agent never preserves "we tried this route and it died because X," so it cheerfully walks back into the same wall. What surprised me was how many people had already built their own patch for it, and how different the patches were: Someone runs working memory entirely out of plain files, the agent decides what to write, old stuff rolls off to a vector DB, no fancy platform. Their line stuck with me: "if you have to ask, the system is broken." Someone else splits memory into layers, stable trusted facts vs citable receipts, and won't let the agent act on anything it can't point to a source for. "I found a likely match" vs "I know." A third had the agent write its own plain english post mortem after every task ("tried X, failed because Y, next time Z") and semantic searches those before new work. Their honest downside: after 30 or 40 of them the file gets noisy, so they added a summarization pass. Another proposed three ledgers (facts, run state, decisions and failures) and routing by intent before semantic search, so "did we try this already" hits the failure log first. Here's the thing that jumped out once I lined them all up. Every one of these solves what to write down. None of them really solves what to keep. The most precise version of this came from someone running it in production. Their take: detecting that a failure happened is mostly doable automatically, you can catch tool errors, failing tests, timeouts, reverted patches, even tasks that end without verification (treating missing verification as its own failure signal, which I thought was sharp, otherwise you only ever log the loud crashes). But they explicitly would not trust automation to infer the actual lesson. What the failure meant, and whether it should stick, still needs a human or a forced explicit write. That split is the whole thing, I think. Detection is tractable. Consolidation is not. How does the system decide what is actually true vs just frequently retrieved, and what survives once it is written? Everyone is using a heuristic for that ("prove it twice," recency, manual rules, a summarization pass), and every heuristic breaks at a predictable edge. Building in this space, which is why I asked. For people running agents in production: where does your system stop trusting automation and start needing a human in the loop? Is that line moving for anyone, or is it stuck where it is?
A discussion on the practical challenges of managing agent memory in AI systems, focusing on avoiding information overload that degrades output quality, and proposing strategies like using workflow state and multi-agent architecture.
A survey of memory implementations across major AI agent harnesses (Claude Code, Codex, Copilot, etc.) reveals common boundary failures including bounded local storage, keyword retrieval, harness scoping, weak staleness handling, and 57-71% cross-user contamination rates, highlighting unsolved problems in agent memory infrastructure.
A reflection on agent memory as primarily an infrastructure/data-management problem rather than an AI problem, focusing on practical complexities like permissions, scopes, and revision history.
A developer discusses limitations in current AI agent memory systems and proposes a new memory layer tool with episode storage and replay debugging, seeking community validation.
A developer reconsiders agent memory as more than storage, proposing a living graph with roles and activation fields to give past information appropriate authority and context.