Tag
该论文提出 FluxMem,一种将智能体记忆视为不断演化的图结构,通过动态修复连接和提炼技能来提升记忆效果的系统。实验显示其在多个任务上优于现有方法,例如在 LoCoMo 上达到 95.06% 准确率,并在 GAIA 上相比 Kimi K2 提升 12.73 分。
ActiveGraph provides a persistent, replayable world for long-running agents using an append-only event log, enabling fork, diff, and lineage capabilities.
GraphARC is a new benchmark for abstract reasoning on graph-structured data, extending the ARC paradigm to graphs. Evaluations of state-of-the-art language models reveal a comprehension-execution gap and performance degradation on larger instances, highlighting scaling challenges.