Some notes and lessons on Agents, RAG and memory
Summary
The author shares notes and lessons learned from building AI agents at scale, focusing on RAG and memory management to help others.
Similar Articles
Memory for agents ain't here yet
A critique of current memory solutions for AI agents, arguing that RAG wrappers and similar approaches fail to address core issues of model bias and context bloat.
Agent memory is not just RAG over user facts
The article argues that simple RAG-based agent memory systems fail in production due to issues like stale preferences, missed keywords, and prompt injection, and advocates for a layered memory architecture with active selection, deterministic fallback, governance, and testing.
The Real Truth About AI Agents
An experienced practitioner shares hard-won lessons from deploying 25+ AI agents to production, arguing that memory, orchestration, and auditability matter far more than model choice. The article details common failure modes like context loss and silent cost loops, and recommends a stack including Claude Sonnet 4, Pydantic AI, and dedicated memory layers like Octopodas.
How AI agent memory works (28 minute read)
The article provides a comprehensive technical overview of how AI agent memory works, distinguishing between working and long-term memory mechanisms, and discussing strategies for context management, embedding-based retrieval, and data lifecycle governance.
What I learned trying to make agent memory survive more than one session
The article reflects on the complexities of AI agent memory beyond simple storage, highlighting challenges such as determining truthfulness, priority changes, distinguishing decisions from noise, and appropriate timing for surfacing context.