Feral v0.2.0 - open-source local AI workspace (llama.cpp + BYOK + agent runtime), now on Windows, macOS and Linux. No telemetry, no subscription, MIT/Apache-2.0

Reddit r/AI_Agents Tools

Summary

Feral v0.2.0 is an open-source local AI workspace that runs GGUF models via llama.cpp, supports BYOK for cloud models, includes an agent runtime with sandboxed tools and a knowledge graph, and now ships on Windows, macOS, and Linux with no telemetry or subscription.

I've been building Feral solo for the past few months, a desktop app for running AI on your own machine and v0.2.0 just shipped with macOS and Linux support, so it felt like the right time to share it here. # What it is: \- Local GGUF models via llama.cpp fully offline chat, nothing leaves your machine \- BYOK for cloud models (OpenAI, Anthropic, Gemini, NVIDIA NIM, etc.) your key, your bill, no proxy in between. Keys live in the OS keychain, never in the frontend \- An agent runtime with sandboxed tool use (file ops, shell with env blocklist + output caps, web research), a skill system, and a persistent memory knowledge graph you can actually inspect and edit in a graph UI \- MCP support app-store style page for Model Context Protocol servers, one-click install \- Vision (paste/drop screenshots), any-file attachments (PDF/Office parsed natively) \- Tauri 2 + Rust, so the installer is small and it's not another Electron app # Honest state of things: \- Windows is the primary, most-tested platform \- macOS and Linux are fresh this release CI-built, lightly tested on real hardware. Consider them beta \- macOS isn't notarized yet (no Apple Developer cert, it's a free open-source project). First launch needs xattr -cr /Applications/Feral.app, and updates may trigger a Keychain permission prompt for your saved API keys. Both documented in the README \- Linux ships as .deb/.rpm without auto-update for now (AppImage had bundling issues, deferred to next release) \- Local inference is text-only for now - vision needs a cloud key No telemetry, no account, no analytics, you can verify, it's all on GitHub under MIT/Apache-2.0. I'll be in the comments, happy to answer anything, and bug reports are genuinely welcome (a macOS user reported a model-picker bug this morning and the fix is already in this build).
Original Article

Similar Articles

Automated AI researcher running locally with llama.cpp

Reddit r/LocalLLaMA

ml-intern is a harness for AI agents that integrates with Hugging Face's libraries and now supports running local models via llama.cpp or ollama, enabling an automated AI researcher to run 24/7 on a laptop.

Build a local AI coding agent from scratch

Reddit r/ArtificialInteligence

A step-by-step guide to building a minimal AI coding agent that runs entirely locally using llama.cpp, GGUF models, and a custom harness, demonstrating how to set up tools and call a model to execute real tasks like creating a landing page.

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

Hugging Face Blog

GGML and llama.cpp have joined Hugging Face to ensure long-term sustainability of local AI development. Georgi Gerganov's team will maintain full autonomy over the projects while receiving resources to scale community support and improve integration between llama.cpp inference and transformers model definitions.