offline

#offline

Wayfinder Router: deterministic routing of queries between local and hosted LLM

Hacker News Top ↗ · 14h ago Cached

Wayfinder Router is an open-source Python tool that deterministically routes prompts to local or hosted LLMs based on structural complexity, without calling any model, enabling offline cost savings.

0 favorites 0 likes

#offline

I made an offline, single-file GPU build picker that estimates what local models a rig will run — and at what tok/s

Reddit r/LocalLLaMA ↗ · yesterday

A developer created an offline, single-file GPU build picker that estimates which local AI models a system can run and at what token generation speed.

0 favorites 0 likes

#offline

Streaming medical STT running locally on a MacBook

Reddit r/LocalLLaMA ↗ · 2d ago

Describes a medical speech-to-text system that runs locally on a MacBook, enabling streaming transcription without cloud dependency.

0 favorites 0 likes

#offline

@googledevs: Deploy local coding agents directly on your laptop with Google Gemma open models → https://goo.gle/gemma-ama-en Join Ia…

X AI KOLs Following ↗ · 2026-06-19 Cached

Google Gemma open models can now be used to deploy local coding agents directly on a laptop, enabling offline execution and faster development workflows.

0 favorites 0 likes

#offline

Mutter AI Dictation

Product Hunt ↗ · 2026-06-19

Mutter AI Dictation is a private AI dictation tool that operates offline.

0 favorites 0 likes

#offline

gave my local llm agent mcp tools for local image + video gen, so it just generates when i ask (fully offline+free)

Reddit r/LocalLLaMA ↗ · 2026-06-18

A user demonstrates giving a local LLM agent MCP tools for local image and video generation, enabling fully offline and free generation on demand.

0 favorites 0 likes

#offline

@hank_aibtc: Amazing! Running Gemma 4 in the browser, on par with ChatGPT?! Completely zero server, zero data upload, offline, pure WebGPU local inference! Xenova has open-sourced all 27 custom WebGPU kernels written by Fable 5: - Gemma 4 E2B (2.3B parameters...)

X AI KOLs Timeline ↗ · 2026-06-18 Cached

The article introduces Xenova's open-sourcing of 27 custom WebGPU kernels, enabling Gemma 4 to run fully offline and locally in the browser at 255 tok/s, and discusses advantages like privacy and offline use. It also mentions FLUX.2's 3D generation capability.

0 favorites 0 likes

#offline

Clawd

Product Hunt ↗ · 2026-06-17

Clawd is a context-aware browser mascot powered by 100% local offline AI.

0 favorites 0 likes

#offline

@hasantoxr: I built a RAG system on my own laptop that never sends a single byte to OpenAI. 100% offline. 100% open source. Here's …

X AI KOLs Timeline ↗ · 2026-06-16 Cached

A user built a fully offline and open-source RAG system on their laptop, emphasizing no data sent to OpenAI. They provide a 6-step guide.

0 favorites 0 likes

#offline

@php_martin: OpenAI Codex is now free. But what really shocked me isn't the free part — it's that local open-source models can deliver AI Agent performance close to the cloud experience. The video demonstrates 4 real-world scenarios: fixing a crashed space game, building a Whac-A-Mole web game in minutes, generating an Apple-style product homepage, and even launching a browser to search, download, and save files on its own.

X AI KOLs Timeline ↗ · 2026-06-16 Cached

OpenAI Codex is now free, but even more surprising is that local open-source models can achieve AI Agent performance close to the cloud, demonstrating scenarios like fixing games and developing web games without requiring API keys or internet.

0 favorites 0 likes

#offline

@no_stp_on_snek: I built a pocket Charles Spurgeon. Ask the Prince of Preachers for counsel or hand him your own sermon draft and let hi…

X AI KOLs Following ↗ · 2026-06-15 Cached

A developer built a pocket Charles Spurgeon AI assistant that runs fully offline on a fine-tuned Gemma model. It can answer theological questions, prepare sermons, and grade sermon drafts in Spurgeon's voice.

0 favorites 0 likes

#offline

"They screwed us": Personality clashes sent Anthropic's models offline

Reddit r/singularity ↗ · 2026-06-15

Internal personality clashes at Anthropic reportedly caused their AI models to go offline.

0 favorites 0 likes

#offline

Show HN: Trace – Offline Mac meeting transcripts you can flag mid-call

Hacker News Top ↗ · 2026-06-13 Cached

Trace is a Mac app that transcribes meetings locally without uploading audio, allowing users to flag moments mid-call and get clean markdown transcripts.

0 favorites 0 likes

#offline

Webxdc - Secure mini apps for chats

Lobsters Hottest ↗ · 2026-06-13 Cached

Webxdc is a secure, peer-to-peer mini app format for chats that runs offline with zero tracking, enabling private games and collaboration without servers or app stores.

0 favorites 0 likes

#offline

Reverie.fm

Product Hunt ↗ · 2026-06-13

Reverie.fm is a private, offline location-based music journal app.

0 favorites 0 likes

#offline

@rohanpaul_ai: Github of Atomic-Chat. "an open source alternative to ChatGPT that runs 100% offline on your computer." https://github.…

X AI KOLs Following ↗ · 2026-06-12 Cached

Atomic-Chat is an open-source desktop and mobile app for running LLMs locally, fully offline, providing a private alternative to ChatGPT.

0 favorites 0 likes

#offline

Ultramemory

Product Hunt ↗ · 2026-06-11

Ultramemory is a private AI memory application for Mac that operates entirely locally with no cloud or account required.

0 favorites 0 likes

#offline

I wired a fully offline voice loop to Ollama + LM Studio — 100% CPU, no GPU, nothing leaves your machine (Silero VAD + Parakeet STT + Supertonic TTS 3)

Reddit r/LocalLLaMA ↗ · 2026-06-11

A fully offline, CPU-only voice loop for local LLMs using Silero VAD, Parakeet STT, and Supertonic TTS, integrated via a one-command installer. Works with Ollama, LM Studio, and various agent frameworks.

0 favorites 0 likes

#offline

Midas: 100% local agent memory — no LLM at ingest, $0, nothing leaves the box (MCP + Python SDK)

Reddit r/AI_Agents ↗ · 2026-06-07

Midas is a local agent memory tool that uses embeddings and ranking instead of LLM calls for ingest, achieving zero cost, offline operation, and high recall with auditable source turns.

0 favorites 0 likes

#offline

@dr_cintas: Google's new algorithm just shrunk 31GB of memory down to 4GB TurboVec is a new open-source tool that stores the data y…

X AI KOLs Timeline ↗ · 2026-06-05 Cached

Google's TurboVec is a new open-source tool that reduces memory usage from 31GB to 4GB for AI search data, leveraging TurboQuant for faster search than FAISS, and integrates with LangChain and LlamaIndex while running fully offline.

0 favorites 0 likes

offline

Submit Feedback