knowledge-point

Tag

Cards List
#knowledge-point

MemTrace: Probing What Final Accuracy Misses in Long-Term Memory

arXiv cs.AI · 2026-06-17 Cached

MemTrace is a benchmark that evaluates LLM agent memory at the knowledge point level, probing how facts behave under varying memory age, question type, and evidence conditions. It reveals that pooled accuracy hides distinct failure modes, and that the main bottleneck is evidence use rather than retrieval.

0 favorites 0 likes
← Back to home

Submit Feedback