self-evolving-agents

Tag

Cards List
#self-evolving-agents

Scaling Self-Evolving Agents via Parametric Memory

arXiv cs.AI · 2d ago Cached

Researchers from Alibaba/Qwen and Peking University introduce TMEM, a self-evolving parametric memory framework that uses online LoRA weight updates to let LLM agents genuinely learn from experience within a single episode, rather than relying solely on prompt-space memory. TMEM outperforms summary-based and retrieval-based baselines across multiple benchmarks including LoCoMo, LongMemEval-S, and CL-Bench.

0 favorites 0 likes
#self-evolving-agents

EVE-Agent: Evidence-Verifiable Self-Evolving Agents

arXiv cs.AI · 2026-05-25 Cached

EVE-Agent introduces a framework for self-evolving search agents that ensure evidence verifiability by generating questions, answers, and evidence spans, and training on marginal accuracy gain of evidence. This improves grounded correctness without human annotations.

0 favorites 0 likes
#self-evolving-agents

Rethinking Experience Utilization in Self-Evolving Language Model Agents

arXiv cs.CL · 2026-05-11 Cached

This paper introduces ExpWeaver, a framework that optimizes how self-evolving language model agents utilize past experiences during runtime decision-making. It demonstrates that selectively invoking experience based on reasoning uncertainty improves performance across various environments and models.

0 favorites 0 likes
#self-evolving-agents

Knowledge-Graph Paths as Intermediate Supervision for Self-Evolving Search Agents

arXiv cs.AI · 2026-05-08 Cached

This paper introduces a method using knowledge-graph paths as intermediate supervision to improve self-evolving search agents. It addresses bottlenecks in Search Self-Play by grounding question construction in relational context and introducing a Waypoint Coverage Reward for graded partial credit.

0 favorites 0 likes
#self-evolving-agents

SkillOS: Learning Skill Curation for Self-Evolving Agents

Hugging Face Daily Papers · 2026-05-07 Cached

This paper introduces SkillOS, a reinforcement learning framework that enables LLM agents to learn long-term skill curation policies for self-evolution, improving performance and generalization across tasks.

0 favorites 0 likes
#self-evolving-agents

On Safety Risks in Experience-Driven Self-Evolving Agents

arXiv cs.CL · 2026-04-21 Cached

Researchers from Harbin Institute of Technology and Singapore Management University investigate safety risks in experience-driven self-evolving LLM agents, finding that even benign task experience can compromise safety in high-risk scenarios due to agents' execution-oriented tendencies, and revealing a fundamental safety–utility trade-off.

0 favorites 0 likes
← Back to home

Submit Feedback