SIA: Self Improving AI with Harness & Weight Updates
Summary
A self-improving AI framework that simultaneously updates both model weights and task-specific agent architecture via a language-model feedback agent, achieving significant gains across legal classification, GPU optimization, and biological denoising tasks.
View Cached Full Text
Cached at: 06/08/26, 07:14 AM
Paper page - SIA: Self Improving AI with Harness & Weight Updates
Source: https://huggingface.co/papers/2605.27276
Abstract
A self-improving AI framework simultaneously updates both model weights and task-specific agent architecture through a language-model feedback agent across legal classification, GPU optimization, and biological data denoising tasks.
Humans are the bottleneck in building and improving AI. Both the models and the agents that wrap them are written, tuned, and corrected by people. The long-horizon goal of an AI that can figure out how to improve itself remains open. Two largely disjoint research lines attack this bottleneck. Theharness-updateschool has a meta-agent rewrite the scaffold of atask-specific agent(its tools, prompts, retry logic, andsearch procedure) while themodel weightsare held fixed. Thetest-time trainingschool uses hand-written RL pipelines to update the model’s own weights on task feedback while the harness is held fixed. These two silos operate in isolation. We propose SIA, aself-improving loopin which alanguage-model agent(theFeedback-Agent) updates both the harness and the weights of atask-specific agent. We evaluate across three contrasting domains: Chinese legal charge classification, low-level GPU kernel optimisation, and single-cell RNA denoising. Combining both levers outperforms scaffold iteration alone on all three benchmarks. The gains are 56.6% on LawBench, 91.9% runtime reduction on GPU kernels, and 502% on denoising over the initial baseline. Harness updates make the model agentic, shaping how it searches and acts, whileweight updatesbuild thedomain intuitionthat no prompt or scaffold can instil.
View arXiv pageView PDFProject pageGitHub754Add to collection
Get this paper in your agent:
hf papers read 2605\.27276
Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash
Models citing this paper0
No model linking this paper
Cite arxiv.org/abs/2605.27276 in a model README.md to link it from this page.
Datasets citing this paper0
No dataset linking this paper
Cite arxiv.org/abs/2605.27276 in a dataset README.md to link it from this page.
Spaces citing this paper0
No Space linking this paper
Cite arxiv.org/abs/2605.27276 in a Space README.md to link it from this page.
Collections including this paper0
No Collection including this paper
Add this paper to acollectionto link it from this page.
Similar Articles
hexo-ai/sia
SIA is a self-improving AI framework that uses a meta-agent, target agent, and feedback agent to autonomously improve performance on benchmark tasks, achieving significant gains on LawBench, GPU kernel optimization, and single-cell RNA denoising.
@rohanpaul_ai: This paper shows an AI improving itself better when it rewrites its setup and updates its model. The problem is that mo…
This paper introduces SIA, a self-improving AI loop that combines scaffold rewriting and weight updates (via LoRA) to enhance task performance. Tested on three diverse tasks, it outperforms setups using only scaffold improvements.
@AlphaSignalAI: https://x.com/AlphaSignalAI/status/2066928605691523210
The article distills 28 research papers into a 10-layer stack for building self-improving harnesses around AI models, emphasizing bounded, gated changes over general agent loops.
Continual Harness: Online Adaptation for Self-Improving Foundation Agents
The paper introduces 'Continual Harness,' a framework enabling embodied AI agents to self-improve online without environment resets. It demonstrates significant progress in playing Pokémon games, achieving human-level performance through automated prompt and skill refinement.
@qinzytech: https://x.com/qinzytech/status/2066585405479371092
A technical analysis of two approaches to building self-evolving AI agents: model-based (via architecture like SSMs or transformer with fast-weight updates, and training methods) and harness-based (via memory or meta harness that can rewrite itself). The author provides practical recommendations for different audiences.