Tag
Wayfinder Router is an open-source Python tool that deterministically routes prompts to local or hosted LLMs based on structural complexity, without calling any model, enabling offline cost savings.
A developer created an offline, single-file GPU build picker that estimates which local AI models a system can run and at what token generation speed.
Describes a medical speech-to-text system that runs locally on a MacBook, enabling streaming transcription without cloud dependency.
Google Gemma open models can now be used to deploy local coding agents directly on a laptop, enabling offline execution and faster development workflows.
Mutter AI Dictation is a private AI dictation tool that operates offline.
A user demonstrates giving a local LLM agent MCP tools for local image and video generation, enabling fully offline and free generation on demand.
The article introduces Xenova's open-sourcing of 27 custom WebGPU kernels, enabling Gemma 4 to run fully offline and locally in the browser at 255 tok/s, and discusses advantages like privacy and offline use. It also mentions FLUX.2's 3D generation capability.
Clawd is a context-aware browser mascot powered by 100% local offline AI.
A user built a fully offline and open-source RAG system on their laptop, emphasizing no data sent to OpenAI. They provide a 6-step guide.
OpenAI Codex is now free, but even more surprising is that local open-source models can achieve AI Agent performance close to the cloud, demonstrating scenarios like fixing games and developing web games without requiring API keys or internet.
A developer built a pocket Charles Spurgeon AI assistant that runs fully offline on a fine-tuned Gemma model. It can answer theological questions, prepare sermons, and grade sermon drafts in Spurgeon's voice.
Internal personality clashes at Anthropic reportedly caused their AI models to go offline.
Trace is a Mac app that transcribes meetings locally without uploading audio, allowing users to flag moments mid-call and get clean markdown transcripts.
Webxdc is a secure, peer-to-peer mini app format for chats that runs offline with zero tracking, enabling private games and collaboration without servers or app stores.
Reverie.fm is a private, offline location-based music journal app.
Atomic-Chat is an open-source desktop and mobile app for running LLMs locally, fully offline, providing a private alternative to ChatGPT.
Ultramemory is a private AI memory application for Mac that operates entirely locally with no cloud or account required.
A fully offline, CPU-only voice loop for local LLMs using Silero VAD, Parakeet STT, and Supertonic TTS, integrated via a one-command installer. Works with Ollama, LM Studio, and various agent frameworks.
Midas is a local agent memory tool that uses embeddings and ranking instead of LLM calls for ingest, achieving zero cost, offline operation, and high recall with auditable source turns.
Google's TurboVec is a new open-source tool that reduces memory usage from 31GB to 4GB for AI search data, leveraging TurboQuant for faster search than FAISS, and integrates with LangChain and LlamaIndex while running fully offline.