My own local first ai harness
Summary
The author built TinyHarness, a low-memory-footprint AI harness compatible with Ollama, Llama.cpp, and vllm, aiming to compete with tools like pi and opencode.
Similar Articles
@_vmlops: This is the best site on the internet to learn harness engineering https://walkinglabs.github.io/learn-harness-engineer…
A comprehensive course teaching harness engineering for AI coding agents, covering environment design, state management, and verification to make agentic coding tools like Codex and Claude Code more reliable.
favorite Agentic Coding Harness
The author compares several agentic coding harnesses (Codex CLI, Claude Code, Gemini CLI, OpenCode, Pi) and finds Pi the leanest and best for local models, praising its simplicity and compatibility with Qwen 27B-MXFP8.
Automated AI researcher running locally with llama.cpp
ml-intern is a harness for AI agents that integrates with Hugging Face's libraries and now supports running local models via llama.cpp or ollama, enabling an automated AI researcher to run 24/7 on a laptop.
best of the best agentic harnesses do this…
The author shares insights on building effective agent harnesses: the best ones minimize LLM reliance for trivial tasks and reserve LLMs for complex reasoning, distinguishing genuine harnesses from simple wrappers.
Build a local AI coding agent from scratch
A step-by-step guide to building a minimal AI coding agent that runs entirely locally using llama.cpp, GGUF models, and a custom harness, demonstrating how to set up tools and call a model to execute real tasks like creating a landing page.