@geekbb: Auto-optimization tool for Agent harness. It takes over the heavy lifting of harness optimization: you provide a benchmark command and a target repository, and it automatically generates proposals, runs evaluations, records results, keeps the best, discards the rest, and automatically improves the agent's prompts, configurations, and source code. https…

X AI KOLs Timeline Tools

Summary

autoharness is an automated agent harness optimization tool that automatically generates proposals and runs evaluations based on benchmark commands to improve an agent's prompts, configurations, and source code. It supports Codex and Claude.

Auto-optimization tool for Agent harness. It takes over the heavy lifting of harness optimization: you provide a benchmark command and a target repository, and it automatically generates proposals, runs evaluations, records results, keeps the best, discards the rest, and automatically improves the agent's prompts, configurations, and source code. https://t.co/2qhYImGjuP https://t.co/t9qGZMZjkP
Original Article
View Cached Full Text

Cached at: 05/11/26, 12:42 PM

Star this repo if you find it useful!

Built with ❤️ by Kayba and the open-source community.

Similar Articles

Claude Code improved my agent harness by 40% overnight

Reddit r/AI_Agents

The author introduces 'Autoharness', a tool that uses Claude Code to autonomously optimize agent harnesses by iterating on prompts and hyperparameters. This resulted in a 40% performance increase on the tau2-airline benchmark.

@GitHub_Daily: Using Claude Code for complex projects, a single agent has limited capabilities. Want multiple agents to collaborate and divide tasks, but manually configuring team structures and skill files is too tedious. Recently found Harness, a Claude Code plugin that automatically generates an entire team architecture from a one-sentence description of your project...

X AI KOLs Timeline

Harness is a Claude Code plugin that automatically generates a multi-agent team architecture based on a one-sentence description. It comes with 6 collaboration modes and 100 ready-made configurations, helping Claude Code transition from solo operation to team collaboration.

@XAMTO_AI: Want to build a production-grade Agent Harness from scratch? Stop dreaming. If you think picking any random framework will do the job, you'll crash and burn. The truth is this is not something that a "framework choice" can handle. Behind it are 15 hardcore responsibilities you can't avoid: each must be a worker that is installable, versionable, and language-switchable...

X AI KOLs Timeline

The article argues that production agent harnesses should not be monolithic frameworks but rather a stack of independent, replaceable workers connected by a shared trigger primitive, outlining 15 core responsibilities and how the iii engine implements this approach.