long-term-memory

Tag

Cards List
#long-term-memory

MEMPROBE: Probing Long-Term Agent Memory via Hidden User-State Recovery

arXiv cs.CL · yesterday Cached

MEMPROBE is a benchmark that evaluates long-term memory in LLM agents by reconstructing hidden user states from the agent's memory after interaction.

0 favorites 0 likes
#long-term-memory

@yoheinakajima: ActiveGraph: 1 month in: Paper #1: The Log is the Agent 3 LongMemEval Experiments Paper #2: Regimes, self-improvement l…

X AI KOLs Following · yesterday Cached

ActiveGraph announces two new papers on agent memory (LongMemEval) and self-improvement regimes, along with reference agents, pack templates, and upcoming meetups in Seattle and San Francisco.

0 favorites 0 likes
#long-term-memory

@dair_ai: Great paper on long-term memory for LLM agents. (bookmark it) Coarse summaries drift and unconstrained updates corrupt,…

X AI KOLs Following · 5d ago Cached

AtomMem introduces a long-term memory system for LLM agents that uses atomic facts as efficient memory units, organizing them into hierarchical event structures and temporal user profiles, achieving state-of-the-art on the LoCoMo benchmark.

0 favorites 0 likes
#long-term-memory

We built a persistent agent memory layer on Elasticsearch with 0.89 recall

Hacker News Top · 6d ago Cached

Elasticsearch blog post describes building a persistent agent memory layer with three memory types (episodic, semantic, procedural), achieving 0.89 recall on a QA eval with zero tenant leaks using hybrid recall and DLS isolation.

0 favorites 0 likes
#long-term-memory

CoreMem: Riemannian Retrieval and Fisher-Guided Distillation for Long-Term Memory in Dialogue Agents

arXiv cs.CL · 2026-06-18 Cached

CoreMem proposes a resource-efficient edge-cloud memory architecture for dialogue agents, using Riemannian retrieval with a Fisher-Rao metric and Fisher-guided discrete token distillation to achieve strong accuracy improvements within an 8 GB VRAM budget.

0 favorites 0 likes
#long-term-memory

MemTrace: Probing What Final Accuracy Misses in Long-Term Memory

arXiv cs.AI · 2026-06-17 Cached

MemTrace is a benchmark that evaluates LLM agent memory at the knowledge point level, probing how facts behave under varying memory age, question type, and evidence conditions. It reveals that pooled accuracy hides distinct failure modes, and that the main bottleneck is evidence use rather than retrieval.

0 favorites 0 likes
#long-term-memory

T-Mem: Memory That Anticipates, Not Archives

arXiv cs.CL · 2026-06-16 Cached

T-Mem is a new long-term conversational memory architecture that enables both descriptive and associative recall, covering scenarios where query and memory share surface features and those where they are connected by latent semantic arcs. It reaches state-of-the-art on the LoCoMo and LoCoMo-Plus benchmarks.

0 favorites 0 likes
#long-term-memory

@simplifyinAI: Tencent just open-sourced Hy-Memory. A memory plugin that gives Al agents real long-term memory using a 6-layer framewo…

X AI KOLs Timeline · 2026-06-12 Cached

Tencent open-sourced Hy-Memory, a memory plugin for AI agents that provides long-term memory with a 6-layer dual-reasoning framework, reducing token usage by 35% and memory bloat by 70%.

0 favorites 0 likes
#long-term-memory

BEAM benchmarks

Reddit r/AI_Agents · 2026-06-12

Midas achieves 0.56 recall@k on BEAM 100K and 0.51 on BEAM 500K with zero LLM calls and zero cost, demonstrating efficient long-term memory for agents.

0 favorites 0 likes
#long-term-memory

MemRefine: LLM-Guided Compression for Long-Term Agent Memory

arXiv cs.CL · 2026-06-12 Cached

MemRefine is an LLM-guided framework for compressing long-term agent memory under fixed storage budgets, using similarity for candidate pairing and an LLM judge for factual deletion/merge decisions, outperforming rule-based baselines on benchmarks.

0 favorites 0 likes
#long-term-memory

Infini Memory: Maintainable Topic Documents for Long-Term LLM Agent Memory

arXiv cs.AI · 2026-06-10 Cached

Introduces Infini Memory, a maintainable text-based persistent memory architecture for LLM agents that uses topic-structured documents and iterative retrieval to improve long-term memory usage, achieving 64.7% on MemoryAgentBench.

0 favorites 0 likes
#long-term-memory

REAL: A Reasoning-Enhanced Graph Framework for Long-Term Memory Management of LLMs

arXiv cs.CL · 2026-06-10 Cached

REAL is a reasoning-enhanced graph framework for long-term memory management of LLMs that uses temporal and confidence-aware directed property graphs with non-destructive temporal updates and hybrid beam search retrieval, achieving an average improvement of 22.72%.

0 favorites 0 likes
#long-term-memory

Is an AI 'memory manager' that decides what to keep/forget actually feasible?

Reddit r/ArtificialInteligence · 2026-06-05

A user questions the feasibility of an AI memory manager system that decides what to keep or forget based on importance, reinforcement, and decay.

0 favorites 0 likes
#long-term-memory

Training-Free Lexical-Dense Fusion for Conversational-Memory Retrieval

arXiv cs.LG · 2026-06-04 Cached

This paper proposes a training-free, CPU-only retrieval method that fuses BM25 lexical scores with late-interaction dense scores for conversational memory retrieval, achieving up to +17.2 points improvement on LoCoMo Hit@1 over late interaction alone across six encoders. The study provides controlled ablations on pooling operators, reranker effects, and benchmark robustness, framing the gain as a division of labor between dense and lexical signals.

0 favorites 0 likes
#long-term-memory

LifeSide: Benchmarking Agents as Lifelong Digital Companions

arXiv cs.CL · 2026-06-04 Cached

LifeSide is a new benchmark for evaluating AI agents as lifelong digital companions, testing memory tracking, user understanding, privacy control, and emotional companionship across 2,000 personas and 111K tasks in multi-session settings. Results show that even top models fail to sustain accurate user understanding and genuine companionship over long horizons.

0 favorites 0 likes
#long-term-memory

SubtleMemory: A Benchmark for Fine-Grained Relational Memory Discrimination in Long-Horizon AI Agents

Hugging Face Daily Papers · 2026-06-04 Cached

SubtleMemory is a benchmark for evaluating AI agents' fine-grained relational memory discrimination in long-horizon interactions, consisting of 1,522 instances over 10 long histories. It reveals limitations in current memory systems for preserving and utilizing nuanced memory relationships.

0 favorites 0 likes
#long-term-memory

@garrytan: GBrain SkillOpt now has 4 E2E evals that verify it working https://github.com/garrytan/gbrain-evals/blob/main/docs/benc…

X AI KOLs Following · 2026-06-03 Cached

Garry Tan's gbrain-evals is an open-source test suite for gbrain, an AI agent's long-term memory, with 4 end-to-end evaluations verifying SkillOpt functionality, achieving high recall and precision on multiple benchmarks.

0 favorites 0 likes
#long-term-memory

@wsl8297: When running complex tasks with AI agents, the most painful thing is often not that the model isn't strong enough, but that as the conversation gets longer, the context starts to overflow. You have to keep filling in background details, re-explaining the process, plus the redundant logs from tool calls — tokens just gush out like a broken pipe. Recently, I saw TencentDB Agent Memory open-sourced by Tencent...

X AI KOLs Timeline · 2026-06-03 Cached

Tencent has open-sourced TencentDB Agent Memory, which solves the AI agent long-context overflow problem through hierarchical memory management (symbolic short-term memory + hierarchical long-term memory). Benchmarks show token consumption reduced by up to 61% and task success rate improved by over 50%.

0 favorites 0 likes
#long-term-memory

@MaxForAI: Yesterday, ByteDance Seed open-sourced a very interesting checkpoint, TaskMem. It is trained on Qwen3-VL-30B-A3B, with the goal not being to directly answer questions, but to enable multimodal Agents to learn to generate more useful long-term memory from video/environment streams. The key is to let the Agent learn in continuous video…

X AI KOLs Timeline · 2026-06-03 Cached

ByteDance Seed has open-sourced the TaskMem checkpoint, trained on Qwen3-VL-30B-A3B. It uses two-stage reinforcement learning to enable multimodal Agents to learn to generate long-term memory from video streams, achieving significant improvements on benchmarks such as VideoMME and EgoLife.

0 favorites 0 likes
#long-term-memory

Agentic AI memory isn't a hoarding problem. It's a pruning problem.

Reddit r/AI_Agents · 2026-06-03

The author argues that AI agent memory should focus on pruning data rather than hoarding, drawing parallels to human memory types (sensory, short-term, long-term) and suggesting that modeling after human memory can reduce token usage while maintaining high-quality context.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback