@HuggingPapers: MemTrace: automatic error tracing for LLM memory systems Traces how memories evolve by transforming memory pipelines in…
Summary
MemTrace automatically traces errors in LLM memory systems by converting memory pipelines into executable graphs, identifying root causes of failures, and self-correcting to improve performance by up to 7.62%.
View Cached Full Text
Cached at: 05/31/26, 03:13 PM
MemTrace: automatic error tracing for LLM memory systems
Traces how memories evolve by transforming memory pipelines into executable graphs.
Automatically pinpoints root causes of failures and self-corrects to boost performance by up to 7.62%. https://t.co/yZ1RV5ZcDs
Similar Articles
@zxlzr: Introducing MemTrace: Making LLM Memory Systems Finally Debuggable Memory is becoming a core component of AI agents. Bu…
MemTrace is a new tool that makes LLM memory systems debuggable by tracing memory operations across multiple turns, addressing the black-box nature of current memory-augmented agents.
MemFail: Stress-Testing Failure Modes of LLM Memory Systems
MemFail is a diagnostic benchmark that isolates failure modes of LLM memory systems by formalizing summarization, storage, and retrieval operations, and evaluating them with adversarially designed datasets.
@hyunji_amy_lee: LLM agents & memory systems operate in continuously updated environments (Git repos, evolving docs). They must process …
MINTEval is a new benchmark for evaluating LLM agents and memory systems in continuously updated environments with frequent context changes. It shows that current systems perform poorly, with an average accuracy of 27.9% across representative systems.
MemEvoBench: Benchmarking Memory MisEvolution in LLM Agents
MemEvoBench introduces the first benchmark for evaluating memory safety in LLM agents, measuring behavioral degradation from adversarial memory injection, noisy outputs, and biased feedback across QA and workflow tasks. The work reveals that memory evolution significantly contributes to safety failures and that static defenses are insufficient.
MemPro: Agentic Memory Systems as Evolvable Programs
MemPro is a system-level evolution framework that treats the memory construction–retrieval pipeline as an evolvable program, using an Evolving Agent to iteratively diagnose failures and create improved versions. Experiments on long-horizon benchmarks show consistent improvement over static and prompt-level baselines with favorable performance–cost trade-off.