memory-management

#memory-management

Context Recycling for Long-Horizon LLM Inference

arXiv cs.CL ↗ · 2h ago Cached

This paper introduces ContextForge, a hierarchical memory architecture that treats the LLM context window as a recyclable workspace, achieving significant token and speed improvements on long-horizon tasks while maintaining accuracy on a 276-million-row enterprise benchmark.

0 favorites 0 likes

#memory-management

The Garbage Collection Handbook: The Art of Automatic Memory Management (2nd Ed)

Hacker News Top ↗ · 7h ago Cached

The second edition of 'The Garbage Collection Handbook' updates the authoritative reference on automatic memory management, covering new hardware, software, and algorithm developments.

0 favorites 0 likes

#memory-management

@wquguru: https://x.com/wquguru/status/2069641926752780384

X AI KOLs Timeline ↗ · 2d ago Cached

This article comprehensively reviews the complete architectural layering of AI Agent Memory as of mid-2026, including rule files, persistent profiles, historical recall, and evidence chains. It explains the storage methods, loading timings, and governance principles of different memory layers, emphasizing the key role of memory in helping agents achieve cross-session compounding work.

0 favorites 0 likes

#memory-management

A game where you're an OS and have to manage processes, memory and I/O events

Hacker News Top ↗ · 2d ago Cached

A game where you play as the operating system, managing processes, memory, and I/O events to keep the user satisfied.

0 favorites 0 likes

#memory-management

Question: how should Hermes agents handle persistent memory across sessions?

Reddit r/AI_Agents ↗ · 2026-06-18

A community discussion about how Hermes agents should handle persistent memory across sessions, exploring an external memory layer (8mem) and comparing memory-aware vs generic outputs.

0 favorites 0 likes

#memory-management

@DODOREACH: https://x.com/DODOREACH/status/2067571689634894251

X AI KOLs Timeline ↗ · 2026-06-18 Cached

After three months using Hermes Agent, the author shares insights on memory management and profile optimization, finding that less memory and fewer profiles lead to better results.

0 favorites 0 likes

#memory-management

llama.cpp - how to free up even more space on your GPU

Reddit r/LocalLLaMA ↗ · 2026-06-17

A thread sharing practical tips for freeing up GPU memory in llama.cpp, such as offloading mmproj to CPU and adjusting KV cache types, while discussing parameters like --cache-type-k/v and --spec-draft-n-max.

0 favorites 0 likes

#memory-management

I gave my AI agents a shared memory via MCP — here's how

Reddit r/AI_Agents ↗ · 2026-06-16

Nexus Memory is an MCP-native memory server that allows AI agents to share context via a unified protocol, enabling persistent and coordinated memory across different agents without custom integration.

0 favorites 0 likes

#memory-management

GeneralVLA-2: Geometry-Aware Reconstruction and Governed Memory for Robot Planning

Hugging Face Daily Papers ↗ · 2026-06-16 Cached

GeneralVLA-2 introduces GeoFuse-MV3D for improved 3D reconstruction and a governed KnowledgeBank for better memory management in robotic manipulation tasks, achieving performance gains on several benchmarks.

0 favorites 0 likes

#memory-management

OPD-Evolver: Cultivating Holistic Agent Evolver via On-Policy Distillation

Hugging Face Daily Papers ↗ · 2026-06-16 Cached

OPD-Evolver proposes a self-evolving agent framework using slow-fast co-evolution and on-policy self-distillation to enhance memory management and policy learning, outperforming existing methods like ReasoningBank and Skill0 across multi-domain benchmarks.

0 favorites 0 likes

#memory-management

@zaimiri: https://x.com/zaimiri/status/2066117404392890835

X AI KOLs Timeline ↗ · 2026-06-14 Cached

A detailed 7-day guide for setting up the Hermes AI agent, covering identity, memory, tools, and Telegram integration.

0 favorites 0 likes

#memory-management

Python 3.14 garbage collection rigamarole

Hacker News Top ↗ · 2026-06-12 Cached

Python 3.14 introduced an incremental garbage collector that was later reverted in 3.14.5 due to memory pressure reports. This article explains the changes, their impact, and the controversy around the revert.

0 favorites 0 likes

#memory-management

G-Long: Graph-Enhanced Memory Management for Efficient Long-Term Dialogue Agents

arXiv cs.CL ↗ · 2026-06-12 Cached

G-Long proposes a graph-enhanced memory management framework for long-term dialogue agents, using a fine-tuned small language model for structured triplet extraction and associative retrieval, achieving state-of-the-art performance in response generation and memory retrieval with reduced computational overhead.

0 favorites 0 likes

#memory-management

How do i prevent llama.cpp from offloading on Swap?

Reddit r/LocalLLaMA ↗ · 2026-06-11

User seeks advice on preventing llama.cpp from offloading KV cache to swap before RAM is fully exhausted, sharing their configuration on an M2 Max with 96GB RAM and a large Qwen model.

0 favorites 0 likes

#memory-management

Beyond Compaction: Structured Context Eviction for Long-Horizon Agents

arXiv cs.CL ↗ · 2026-06-11 Cached

Introduces Context Window Lifecycle (CWL), a structured context eviction scheme for long-horizon LLM agents that maintains an effectively unbounded working horizon by evicting content based on a dependency graph, avoiding the limitations of summarization-based compaction and recency truncation.

0 favorites 0 likes

#memory-management

Unix GC Remastered

Hacker News Top ↗ · 2026-06-10 Cached

A detailed walkthrough of the Linux kernel's AF_UNIX garbage collector rewrite, explaining the background, the new graph-based model, and a Use-After-Free bug.

0 favorites 0 likes

#memory-management

REAL: A Reasoning-Enhanced Graph Framework for Long-Term Memory Management of LLMs

arXiv cs.CL ↗ · 2026-06-10 Cached

REAL is a reasoning-enhanced graph framework for long-term memory management of LLMs that uses temporal and confidence-aware directed property graphs with non-destructive temporal updates and hybrid beam search retrieval, achieving an average improvement of 22.72%.

0 favorites 0 likes

#memory-management

Are bigger context windows actually the wrong direction for agents?

Reddit r/AI_Agents ↗ · 2026-06-10

The author questions whether the focus on expanding context windows for AI agents is counterproductive, arguing that accumulated junk slows down long sessions and suggests keeping working context small with external memory.

0 favorites 0 likes

#memory-management

zalloc: Use zig allocators in your c code

Lobsters Hottest ↗ · 2026-06-08 Cached

zalloc replaces malloc, calloc, realloc, and free in C modules with Zig allocators, enabling Zig-style memory management in C code.

0 favorites 0 likes

#memory-management

Is an AI 'memory manager' that decides what to keep/forget actually feasible?

Reddit r/ArtificialInteligence ↗ · 2026-06-05

A user questions the feasibility of an AI memory manager system that decides what to keep or forget based on importance, reinforcement, and decay.

0 favorites 0 likes

memory-management

Submit Feedback