skill-curation

#skill-curation

Not All Skills Help: Measuring and Repairing Agent Knowledge

arXiv cs.CL ↗ · 5d ago Cached

This paper identifies that naive skill accumulation in LLM agents can cause performance regressions, as skills beneficial for some tasks hurt others. The authors propose Assay, a framework that measures per-skill causal contributions and applies per-task masking, achieving state-of-the-art results on AppWorld and τ-bench without weight updates.

0 favorites 0 likes

#skill-curation

Google's SkillOS for Self-Evolving AI Agents (22 minute read)

TLDR AI ↗ · 2026-05-11 Cached

Google Cloud AI Research introduces SkillOS, a reinforcement learning framework enabling LLM-based agents to self-evolve by curating reusable skills from past experiences.

0 favorites 0 likes

#skill-curation

SkillOS: Learning Skill Curation for Self-Evolving Agents

Hugging Face Daily Papers ↗ · 2026-05-07 Cached

This paper introduces SkillOS, a reinforcement learning framework that enables LLM agents to learn long-term skill curation policies for self-evolution, improving performance and generalization across tasks.

0 favorites 0 likes

skill-curation

Not All Skills Help: Measuring and Repairing Agent Knowledge

Google's SkillOS for Self-Evolving AI Agents (22 minute read)

SkillOS: Learning Skill Curation for Self-Evolving Agents

Submit Feedback