SimpleMem: Efficient Lifelong Memory for LLM Agents
Summary
Introduces SimpleMem, an efficient memory framework for LLM agents that uses semantic lossless compression to improve accuracy and reduce token consumption, achieving 26.4% F1 improvement and up to 30x reduction in inference-time token usage.
View Cached Full Text
Cached at: 05/24/26, 12:26 AM
Paper page - SimpleMem: Efficient Lifelong Memory for LLM Agents
Source: https://huggingface.co/papers/2601.02553
Abstract
Tosupportreliablelong-terminteractionincomplexenvironments,LLMagentsrequirememorysystemsthatefficientlymanagehistoricalexperiences.Existingapproacheseitherretainfullinteractionhistoriesviapassivecontextextension,leadingtosubstantialredundancy,orrelyoniterativereasoningtofilternoise,incurringhightokencosts.Toaddressthischallenge,weintroduceSimpleMem,anefficientmemoryframeworkbasedonsemanticlosslesscompression.Weproposeathree-stagepipelinedesignedtomaximizeinformationdensityandtokenutilization:(1)SemanticStructuredCompression,whichappliesentropy-awarefilteringtodistillunstructuredinteractionsintocompact,multi-viewindexedmemoryunits;(2)RecursiveMemoryConsolidation,anasynchronousprocessthatintegratesrelatedunitsintohigher-levelabstractrepresentationstoreduceredundancy;and(3)AdaptiveQuery-AwareRetrieval,whichdynamicallyadjustsretrievalscopebasedonquerycomplexitytoconstructprecisecontextefficiently.Experimentsonbenchmarkdatasetsshowthatourmethodconsistentlyoutperformsbaselineapproachesinaccuracy,retrievalefficiency,andinferencecost,achievinganaverageF1improvementof26.4%whilereducinginference-timetokenconsumptionbyupto30-fold,demonstratingasuperiorbalancebetweenperformanceandefficiency.Codeisavailableathttps://github.com/aiming-lab/SimpleMem.
View arXiv pageView PDFProject pageGitHub3.39kAdd to collection
Get this paper in your agent:
hf papers read 2601\.02553
Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash
Models citing this paper0
No model linking this paper
Cite arxiv.org/abs/2601.02553 in a model README.md to link it from this page.
Datasets citing this paper2
#### molmohsen/awesome-ai-agent-papers #### zhongweixie/A-Survey-on-AI-Agent-Harness Viewer• Updated2 days ago • 1 • 28
Spaces citing this paper0
No Space linking this paper
Cite arxiv.org/abs/2601.02553 in a Space README.md to link it from this page.
Collections including this paper4
Similar Articles
RecMem: Recurrence-based Memory Consolidation for Efficient and Effective Long-Running LLM Agents
RecMem is a recurrence-based memory consolidation method for long-running LLM agents that reduces token consumption by up to 87% while improving accuracy, by only invoking LLMs when semantically similar interactions recur.
MemRefine: LLM-Guided Compression for Long-Term Agent Memory
MemRefine is an LLM-guided framework for compressing long-term agent memory under fixed storage budgets, using similarity for candidate pairing and an LLM judge for factual deletion/merge decisions, outperforming rule-based baselines on benchmarks.
Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory
Mem0 introduces a scalable memory-centric architecture using graph-based representations to improve long-term conversational coherence in LLMs, significantly reducing latency and token costs while outperforming existing memory systems.
DimMem: Dimensional Structuring for Efficient Long-Term Agent Memory
DimMem introduces a dimensional memory framework for LLM agents that represents memories as atomic, typed units with explicit fields, achieving state-of-the-art accuracy on LoCoMo-10 and LongMemEval-S while reducing token costs by 24%.
ElasticMem: Latent Memory as a Learnable Resource for LLM Agents
ElasticMem introduces a learnable latent memory mechanism for LLM agents that adaptively allocates variable budgets to retrieved memories, improving performance on memory-intensive QA and embodied agent tasks while reducing token costs.