@hasantoxr: I'm replacing every memory layer I've ever built into an agent with this. SureThing dropped SOTA on LongMemEval. 88.0% …

X AI KOLs Timeline 05/12/26, 02:16 PM Tools

Summary

SureThing has achieved state-of-the-art results on the LongMemEval benchmark, scoring 88.0% overall, prompting developers to replace existing memory layers in their AI agents.

I'm replacing every memory layer I've ever built into an agent with this. SureThing dropped SOTA on LongMemEval. 88.0% overall. 91.0% knowledge update. 76.7% single-session preference. Number one across every category that actually matters. Then their own AI walked up to the screen and started explaining the whole thing itself. Nobody asked it to.

Original Article

Similar Articles

@yoheinakajima: ran my first benchmark this weekend (longmemeval) mostly to test activegraph, learned a lot! - this is a stepping stone…

X AI KOLs Timeline

Yohei Nakajima ran the LongMemEval benchmark on ActiveGraph, achieving 85.6% QA accuracy and 86.2% turn answer-in-context, demonstrating the effectiveness of event-based agent systems for long-term memory.

Benchmarking agent memory retrieval on LongMemEval‑S — 98% Recall@5, 100% recall by R@23, local embeddings only (all-MiniLM-L6-v2), no LLM, no API key

Reddit r/AI_Agents

The author shares benchmark results for memweave, a Python library for agent memory, achieving 98% Recall@5 on LongMemEval-S using only local embeddings without LLM calls. The post details the methodology and compares performance against mempalace, highlighting stable retrieval across different question types.

I built an agent memory layer that returns a "proof tree" with every answer - what it knew, when, and why

Reddit r/AI_Agents

A new hosted API memory layer for AI agents returns a proof tree with every answer, including bi-temporal versioning, audit trails, and hash verification, achieving 80.2% on LongMemEval-S with transparent benchmarks.

@simplifyinAI: Tencent just open-sourced Hy-Memory. A memory plugin that gives Al agents real long-term memory using a 6-layer framewo…

X AI KOLs Timeline

Tencent open-sourced Hy-Memory, a memory plugin for AI agents that provides long-term memory with a 6-layer dual-reasoning framework, reducing token usage by 35% and memory bloat by 70%.

MemoryOS – AI agent memory with temporal knowledge graph and 9ms ingest and 78ms retrieval

Reddit r/AI_Agents

MemoryOS is an open-source, self-hosted AI agent memory tool using a temporal knowledge graph, achieving 86.2% accuracy on LongMemEval-s with fast 78ms retrieval speeds.

Similar Articles

@yoheinakajima: ran my first benchmark this weekend (longmemeval) mostly to test activegraph, learned a lot! - this is a stepping stone…

Benchmarking agent memory retrieval on LongMemEval‑S — 98% Recall@5, 100% recall by R@23, local embeddings only (all-MiniLM-L6-v2), no LLM, no API key

I built an agent memory layer that returns a "proof tree" with every answer - what it knew, when, and why

@simplifyinAI: Tencent just open-sourced Hy-Memory. A memory plugin that gives Al agents real long-term memory using a 6-layer framewo…

MemoryOS – AI agent memory with temporal knowledge graph and 9ms ingest and 78ms retrieval

Submit Feedback