SimpleMem: Efficient Lifelong Memory for LLM Agents

Papers with Code Trending 01/05/26, 09:02 PM Papers

Summary

Introduces SimpleMem, an efficient memory framework for LLM agents that uses semantic lossless compression to improve accuracy and reduce token consumption, achieving 26.4% F1 improvement and up to 30x reduction in inference-time token usage.

To support reliable long-term interaction in complex environments, LLM agents require memory systems that efficiently manage historical experiences. Existing approaches either retain full interaction histories via passive context extension, leading to substantial redundancy, or rely on iterative reasoning to filter noise, incurring high token costs. To address this challenge, we introduce SimpleMem, an efficient memory framework based on semantic lossless compression. We propose a three-stage pipeline designed to maximize information density and token utilization: (1) Semantic Structured Compression, which applies entropy-aware filtering to distill unstructured interactions into compact, multi-view indexed memory units; (2) Recursive Memory Consolidation, an asynchronous process that integrates related units into higher-level abstract representations to reduce redundancy; and (3) Adaptive Query-Aware Retrieval, which dynamically adjusts retrieval scope based on query complexity to construct precise context efficiently. Experiments on benchmark datasets show that our method consistently outperforms baseline approaches in accuracy, retrieval efficiency, and inference cost, achieving an average F1 improvement of 26.4% while reducing inference-time token consumption by up to 30-fold, demonstrating a superior balance between performance and efficiency. Code is available at https://github.com/aiming-lab/SimpleMem.

Original Article

View Cached Full Text

Cached at: 05/24/26, 12:26 AM

Paper page - SimpleMem: Efficient Lifelong Memory for LLM Agents

Source: https://huggingface.co/papers/2601.02553

Abstract

Tosupportreliablelong-terminteractionincomplexenvironments,LLMagentsrequirememorysystemsthatefficientlymanagehistoricalexperiences.Existingapproacheseitherretainfullinteractionhistoriesviapassivecontextextension,leadingtosubstantialredundancy,orrelyoniterativereasoningtofilternoise,incurringhightokencosts.Toaddressthischallenge,weintroduceSimpleMem,anefficientmemoryframeworkbasedonsemanticlosslesscompression.Weproposeathree-stagepipelinedesignedtomaximizeinformationdensityandtokenutilization:(1)SemanticStructuredCompression,whichappliesentropy-awarefilteringtodistillunstructuredinteractionsintocompact,multi-viewindexedmemoryunits;(2)RecursiveMemoryConsolidation,anasynchronousprocessthatintegratesrelatedunitsintohigher-levelabstractrepresentationstoreduceredundancy;and(3)AdaptiveQuery-AwareRetrieval,whichdynamicallyadjustsretrievalscopebasedonquerycomplexitytoconstructprecisecontextefficiently.Experimentsonbenchmarkdatasetsshowthatourmethodconsistentlyoutperformsbaselineapproachesinaccuracy,retrievalefficiency,andinferencecost,achievinganaverageF1improvementof26.4%whilereducinginference-timetokenconsumptionbyupto30-fold,demonstratingasuperiorbalancebetweenperformanceandefficiency.Codeisavailableathttps://github.com/aiming-lab/SimpleMem.

View arXiv page View PDF Project page GitHub3.39k Add to collection

Get this paper in your agent:

hf papers read 2601\.02553

Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash

Models citing this paper0

No model linking this paper

Cite arxiv.org/abs/2601.02553 in a model README.md to link it from this page.

Datasets citing this paper2

#### molmohsen/awesome-ai-agent-papers #### zhongweixie/A-Survey-on-AI-Agent-Harness Viewer• Updated2 days ago • 1 • 28

Spaces citing this paper0

No Space linking this paper

Cite arxiv.org/abs/2601.02553 in a Space README.md to link it from this page.

SimpleMem: Efficient Lifelong Memory for LLM Agents

Paper page - SimpleMem: Efficient Lifelong Memory for LLM Agents

Abstract

Models citing this paper0

Datasets citing this paper2

Spaces citing this paper0

Collections including this paper4

Similar Articles

RecMem: Recurrence-based Memory Consolidation for Efficient and Effective Long-Running LLM Agents

MemRefine: LLM-Guided Compression for Long-Term Agent Memory

Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

DimMem: Dimensional Structuring for Efficient Long-Term Agent Memory

ElasticMem: Latent Memory as a Learnable Resource for LLM Agents

Submit Feedback

Similar Articles

RecMem: Recurrence-based Memory Consolidation for Efficient and Effective Long-Running LLM Agents

MemRefine: LLM-Guided Compression for Long-Term Agent Memory

Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

DimMem: Dimensional Structuring for Efficient Long-Term Agent Memory

ElasticMem: Latent Memory as a Learnable Resource for LLM Agents