memory-evaluation

Tag

Cards List
#memory-evaluation

@_akhaliq: LongMINT Evaluating Memory under Multi-Target Interference in Long-Horizon Agent Systems

X AI KOLs Following · 2026-05-21 Cached

LongMINT is a benchmark for evaluating memory under multi-target interference in long-horizon agent systems.

0 favorites 0 likes
#memory-evaluation

MEME: Multi-entity & Evolving Memory Evaluation

Hugging Face Daily Papers · 2026-05-12 Cached

The MEME benchmark evaluates AI memory systems across multiple entities and evolving conditions, revealing significant challenges in dependency reasoning that persist even with advanced retrieval techniques.

0 favorites 0 likes
← Back to home

Submit Feedback