@Sumanth_077: Let Agents Design Agents! Memento-Skills is a self-evolving agent framework where agents learn from failures and rewrit…
Summary
Memento-Skills is a self-evolving agent framework where agents learn from failures and rewrite their own skills, improving over time through a Read-Execute-Reflect-Write loop. It was tested on HLE and GAIA benchmarks and supports open-source LLMs like Kimi, MiniMax, and GLM.
View Cached Full Text
Cached at: 06/10/26, 05:47 AM
Let Agents Design Agents!
Memento-Skills is a self-evolving agent framework where agents learn from failures and rewrite their own skills.
Most agent frameworks treat skills as static. You write them once, load them into context, and hope they work. When they fail, you debug manually or try again with the same broken skill.
Memento-Skills takes a different approach. When a skill fails, the system reflects on why it failed, locates the broken skill, rewrites it, and stores the improved version back into the skill library.
Here’s how it works:
The framework runs a continuous Read → Execute → Reflect → Write loop.
Read: Retrieve candidate skills from the local library instead of loading every skill into context.
Execute: Run skills in a local sandbox with actual tool calling - file operations, web search, scripts, external systems.
Reflect: When execution fails, the system records what went wrong, updates the skill’s utility score, and attributes the failure to specific skills.
Write: Rewrite broken skills, optimize weak ones, or create new skills when nothing suitable exists.
This isn’t about accumulating more skills. It’s about building a skill library that improves through task experience.
The system was tested on HLE (Humanity’s Last Exam) and GAIA (General AI Assistants) benchmarks. Performance improved over multiple learning rounds as the skill library grew from basic atomic skills into a richer set of learned capabilities.
Built for open-source LLM ecosystems - works with Kimi, MiniMax, GLM, and other OpenAI-compatible endpoints.
Comes with 9 built-in skills (filesystem, web-search, PDF, docx, xlsx, pptx, image analysis, skill-creator, dependency install) that serve as the starting point for the evolving library.
It’s 100% open source
Link to Memento-Skills in comments!
Similar Articles
MUSE-Autoskill: Self-Evolving Agents via Skill Creation, Memory, Management, and Evaluation
MUSE-Autoskill proposes a skill-centric agent framework that enables LLM agents to continuously create, reuse, and refine skills through a unified lifecycle of creation, memory, management, evaluation, and refinement. Experiments on SkillsBench show that lifecycle-managed skills improve task success, efficiency, reuse, and cross-agent transfer.
SkillMaster: Toward Autonomous Skill Mastery in LLM Agents
This paper introduces SkillMaster, a training framework that enables LLM agents to autonomously create, refine, and select skills through trajectory-informed review and counterfactual utility evaluation.
OpenSkill: Open-World Self-Evolution for LLM Agents
OpenSkill is a framework for LLM agents to self-evolve skills and verification signals from open-world resources without target-task supervision, achieving high performance across benchmarks.
Experience Makes Skillful: Enabling Generalizable Medical Agent Reasoning via Self-Evolving Skill Memory
This paper introduces SkeMex, a self-evolving framework that enhances medical agents by distilling interaction trajectories into structured skill memory, enabling better long-term clinical reasoning through context-dependent utility estimation and governance.
SkillGen: Verified Inference-Time Agent Skill Synthesis
This article introduces SkillGen, a multi-agent framework that synthesizes and verifies reusable inference-time skills for LLM agents by contrasting successful and failed trajectories. The method ensures skills are auditable and empirically verified for their net positive impact on agent performance.