@hasantoxr: I'm replacing every memory layer I've ever built into an agent with this. SureThing dropped SOTA on LongMemEval. 88.0% …
Summary
SureThing has achieved state-of-the-art results on the LongMemEval benchmark, scoring 88.0% overall, prompting developers to replace existing memory layers in their AI agents.
Similar Articles
@yoheinakajima: ran my first benchmark this weekend (longmemeval) mostly to test activegraph, learned a lot! - this is a stepping stone…
Yohei Nakajima ran the LongMemEval benchmark on ActiveGraph, achieving 85.6% QA accuracy and 86.2% turn answer-in-context, demonstrating the effectiveness of event-based agent systems for long-term memory.
Benchmarking agent memory retrieval on LongMemEval‑S — 98% Recall@5, 100% recall by R@23, local embeddings only (all-MiniLM-L6-v2), no LLM, no API key
The author shares benchmark results for memweave, a Python library for agent memory, achieving 98% Recall@5 on LongMemEval-S using only local embeddings without LLM calls. The post details the methodology and compares performance against mempalace, highlighting stable retrieval across different question types.
I built an agent memory layer that returns a "proof tree" with every answer - what it knew, when, and why
A new hosted API memory layer for AI agents returns a proof tree with every answer, including bi-temporal versioning, audit trails, and hash verification, achieving 80.2% on LongMemEval-S with transparent benchmarks.
@simplifyinAI: Tencent just open-sourced Hy-Memory. A memory plugin that gives Al agents real long-term memory using a 6-layer framewo…
Tencent open-sourced Hy-Memory, a memory plugin for AI agents that provides long-term memory with a 6-layer dual-reasoning framework, reducing token usage by 35% and memory bloat by 70%.
MemoryOS – AI agent memory with temporal knowledge graph and 9ms ingest and 78ms retrieval
MemoryOS is an open-source, self-hosted AI agent memory tool using a temporal knowledge graph, achieving 86.2% accuracy on LongMemEval-s with fast 78ms retrieval speeds.