My own local first ai harness

Reddit r/LocalLLaMA Tools

Summary

The author built TinyHarness, a low-memory-footprint AI harness compatible with Ollama, Llama.cpp, and vllm, aiming to compete with tools like pi and opencode.

Hi, i just wanted to share what im playing with for last couple weaks. I built my own AI harness: [TinyHarness](https://github.com/PTFOPlayer/TinyHarness) My main goal was low memory footprint, it is not written in Typescript/Javascript/Python, leaving as much memory as possible for running local models. Its compatible with Ollama, Llama.cpp and vllm and it can access web throught ollama web search api. The ambition is to make a competitor to tools like pi and opencode in the near future. Please roast it, i need every bit of criticism to improve it
Original Article

Similar Articles

favorite Agentic Coding Harness

Reddit r/LocalLLaMA

The author compares several agentic coding harnesses (Codex CLI, Claude Code, Gemini CLI, OpenCode, Pi) and finds Pi the leanest and best for local models, praising its simplicity and compatibility with Qwen 27B-MXFP8.

Automated AI researcher running locally with llama.cpp

Reddit r/LocalLLaMA

ml-intern is a harness for AI agents that integrates with Hugging Face's libraries and now supports running local models via llama.cpp or ollama, enabling an automated AI researcher to run 24/7 on a laptop.

best of the best agentic harnesses do this…

Reddit r/AI_Agents

The author shares insights on building effective agent harnesses: the best ones minimize LLM reliance for trivial tasks and reserve LLMs for complex reasoning, distinguishing genuine harnesses from simple wrappers.

Build a local AI coding agent from scratch

Reddit r/ArtificialInteligence

A step-by-step guide to building a minimal AI coding agent that runs entirely locally using llama.cpp, GGUF models, and a custom harness, demonstrating how to set up tools and call a model to execute real tasks like creating a landing page.