Unsupervised Skill Discovery for Agentic Data Analysis
Summary
DataCOPE is an unsupervised verifier-guided skill discovery framework for data-analytic agents that derives verifier signals from exploration trajectories without labeled supervision. It improves performance by 9.71% and 32.30% on report-style and reasoning-style data analysis tasks respectively.
Similar Articles
COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation
This paper presents COLLEAGUE.SKILL, an open-source system for automatically distilling person-grounded AI skills from heterogeneous traces into inspectable, correctable, and portable skill packages, enabling LLM agents to carry bounded representations of human expertise and interaction style.
AgentCo-op: Retrieval-Based Synthesis of Interoperable Multi-Agent Workflows
AgentCo-op is a retrieval-based synthesis framework for composing interoperable multi-agent workflows from reusable skills, tools, and external agents. It uses typed artifact handoffs and bounded self-guided local repair, achieving strong results on benchmarks and enabling collaborative discovery in open-world genomics tasks.
SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents
SkillFlow introduces a benchmark of 166 tasks across 20 families for evaluating autonomous agents' ability to discover, repair, and maintain skills over time through a lifelong learning protocol. Experiments reveal a substantial capability gap among leading models, with Claude Opus 4.6 improving significantly while others show limited or negative gains from skill evolution.
Don't Retrieve, Navigate: Distilling Enterprise Knowledge into Navigable Agent Skills for QA and RAG
Corpus2Skill (C2S) is an agentic RAG system that replaces traditional vector/BM25 retrieval with a navigable skill hierarchy tree, allowing LLMs to browse enterprise knowledge directly at query time without embedding models or retrieval indexes.
SkillGen: Verified Inference-Time Agent Skill Synthesis
This article introduces SkillGen, a multi-agent framework that synthesizes and verifies reusable inference-time skills for LLM agents by contrasting successful and failed trajectories. The method ensures skills are auditable and empirically verified for their net positive impact on agent performance.