EvoDS: Self-Evolving Autonomous Data Science Agent with Skill Learning and Context Management
Summary
EvoDS is a self-evolving autonomous data science agent that improves via reinforcement learning-driven skill acquisition and adaptive context compression, outperforming open-source agents by 28.9% on benchmarks.
View Cached Full Text
Cached at: 06/05/26, 06:07 AM
Paper page - EvoDS: Self-Evolving Autonomous Data Science Agent with Skill Learning and Context Management
Source: https://huggingface.co/papers/2606.03841
Abstract
EvoDS introduces a self-evolving autonomous data science agent that enhances its capabilities through skill acquisition and adaptive context management via reinforcement learning.
Recent progress in Large Language Model (LLM) agents has enabled promising advances in automated data science. However, existing approaches remain fundamentally limited by their static action sets and lack of principled long-horizon context management, hindering their ability to accumulate reusable experience across tasks and operate reliably in multi-stage, iterative data science pipelines. To address these challenges, we introduce EvoDS, a self-evolving autonomous data science agent that learns to expand its skills and adaptively managing long-term context throughagentic reinforcement learning. Specifically, EvoDS introduces two key strategies: (1)Autonomous Skill Acquisition(ASA) mechanism, which enables agents to synthesize, validate, and reuse executable skills; and (2)Adaptive Context Compression(ACC) strategy, which treats context management as a learned control problem rather than passive truncation. These strategies are orchestrated within a two-stagemulti-agent trainingscheme, enabling EvoDS to autonomously improve over time. Theoretically, we prove that EvoDS’s hierarchical design reduces tool-selection error, and its optimization objective aligns with aninformation bottleneck principle, ensuring efficient context use. Empirically, EvoDS outperforms state-of-the-art open-source data science agents by an average of 28.9% across four diverse benchmarks while eliminating out-of-token failures. Our code and data are available at https://github.com/usail-hkust/EvoDS.
View arXiv pageView PDFProject pageGitHub1Add to collection
Get this paper in your agent:
hf papers read 2606\.03841
Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash
Models citing this paper0
No model linking this paper
Cite arxiv.org/abs/2606.03841 in a model README.md to link it from this page.
Datasets citing this paper0
No dataset linking this paper
Cite arxiv.org/abs/2606.03841 in a dataset README.md to link it from this page.
Spaces citing this paper0
No Space linking this paper
Cite arxiv.org/abs/2606.03841 in a Space README.md to link it from this page.
Collections including this paper0
No Collection including this paper
Add this paper to acollectionto link it from this page.
Similar Articles
EvoMaster: A Foundational Agent Framework for Building Evolving Autonomous Scientific Agents at Scale
EvoMaster is a scalable, self-evolving agent framework for large-scale scientific discovery that enables iterative hypothesis refinement and knowledge accumulation across experimental cycles. It achieves state-of-the-art results on four benchmarks including Humanity's Last Exam (41.1%) and MLE-Bench Lite (75.8%), outperforming general-purpose baselines by up to 316%.
EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery
EvoScientist is an adaptive multi-agent framework for end-to-end scientific discovery that continuously improves through persistent memory modules, comprising three specialized agents for idea generation, experiment execution, and knowledge distillation. It outperforms 7 state-of-the-art systems in scientific idea generation and improves code execution success rates through multi-agent evolution.
@tom_doerr: Semi-autonomous agents optimize codebases through parallel experimentation https://github.com/evo-hq/evo
Evo is an open-source tool that provides semi-autonomous agents to optimize codebases through parallel experimentation, using tree search and multiple subagents to autonomously discover and improve metrics.
EvoSci: A Bio-Inspired Multi-Agent Framework for the Evolution of Scientific Discovery
EvoSci proposes a bio-inspired multi-agent framework that integrates evolutionary algorithms with knowledge graph modeling to iteratively generate, evaluate, and refine research ideas, achieving top performance in peer-review evaluations.
@tom_doerr: Automates research workflows with persistent multi-agent memory https://github.com/EvoScientist/EvoScientist…
EvoScientist is an open-source framework that automates research workflows using self-evolving AI scientists with persistent multi-agent memory, adopting a human-on-the-loop paradigm for autonomous research exploration and insight generation.