on-device

#on-device

@laobaishare: This is incredible. Google just dropped a free AI voice dictation app, supporting iOS and Mac. All paid features unlocked, no subscription needed. 100% free, fully local, powered by Gemma 4. Download here: https://ai.google.dev/edg…

X AI KOLs Timeline ↗ · 2026-06-03 Cached

Google launched a free AI voice dictation app, powered by Gemma 4, supporting iOS and Mac, fully local, no subscription needed.

0 favorites 0 likes

#on-device

Microsoft Aion 1.0 Instruct and Aion 1.0 Plan models!

Reddit r/LocalLLaMA ↗ · 2026-06-03

Microsoft announced two new on-device AI models at Build 2026: Aion 1.0 Instruct, an open-weights small language model, and Aion 1.0 Plan, a 14B parameter reasoning and tool-calling model for local agentic workflows.

0 favorites 0 likes

#on-device

Show HN: Live breath detection and biofeedback from a phone microphone

Hacker News Top ↗ · 2026-06-02 Cached

An open-source project that uses a phone microphone for live breath detection and biofeedback, processing audio on-device to enhance self-awareness without wearables or cloud uploads.

0 favorites 0 likes

#on-device

Help Wanted: Opinions

Reddit r/artificial ↗ · 2026-06-01

A solo developer is building Scout, an AI companion that runs entirely on-device without cloud or account, and is seeking feedback before beta release.

0 favorites 0 likes

#on-device

NVIDIA Levels Up Local AI Agents Across RTX PCs and DGX Spark

NVIDIA Blog ↗ · 2026-06-01 Cached

NVIDIA announced RTX Spark PCs and a wave of updates to enable local AI agents across RTX and DGX ecosystems, including the OpenShell runtime coming to Windows, NemoClaw expansion, performance improvements, and integrations with Adobe and H Company.

0 favorites 0 likes

#on-device

MiniCPM5-1B Shows Why the Small-Model Race Isn't Over

Reddit r/ArtificialInteligence ↗ · 2026-05-31 Cached

MiniCPM5-1B is a 1B parameter model from OpenBMB that achieves impressive scores on AIME 2025 and τ2-Bench Telecom, outperforming larger models. It features both fast and reasoning modes from a single checkpoint, enabled by a three-stage post-training process including supervised fine-tuning, reinforcement learning, and on-policy distillation.

0 favorites 0 likes

#on-device

google/magenta-realtime-2

Hugging Face Models Trending ↗ · 2026-05-28 Cached

Google DeepMind released Magenta RealTime 2, an open music generation model for on-device streaming with low-latency control via text, audio examples, and MIDI.

0 favorites 0 likes

#on-device

UI-KOBE: Knowledge-Oriented Behavior Exploration for Lightweight Graph-Guided GUI Agents

Hugging Face Daily Papers ↗ · 2026-05-28 Cached

UI-KOBE proposes a framework that enhances lightweight mobile GUI agents by constructing and leveraging app-specific knowledge graphs to improve task planning and execution efficiency.

0 favorites 0 likes

#on-device

Signs Beat Floats: Low-Rank Double-Binary Adaptation for On-Device Fine-Tuning

arXiv cs.LG ↗ · 2026-05-26 Cached

LoRDBA replaces LoRA's floating-point low-rank factors with binary sign carriers and channel-wise scales, enabling efficient on-device fine-tuning with significant footprint reduction and minimal latency overhead, matching fp16 quality.

0 favorites 0 likes

#on-device

MobileMoE: Scaling On-Device Mixture of Experts

Hugging Face Daily Papers ↗ · 2026-05-26 Cached

MobileMoE introduces efficient on-device mixture-of-experts language models with sub-billion parameters, achieving better performance and efficiency than dense baselines and existing MoE models. The models are trained on open-source datasets and demonstrate significant speedups on commodity smartphones.

0 favorites 0 likes

#on-device

MiniCPM5-1B

Reddit r/LocalLLaMA ↗ · 2026-05-25 Cached

OpenBMB releases MiniCPM5-1B, a dense 1B Transformer model achieving SOTA among open-source 1B-class models, designed for on-device deployment with hybrid reasoning and long-context support.

0 favorites 0 likes

#on-device

@heyshrutimishra: Full-sized AI models now run on phones. That's BitCPM, a new open-source model from ModelBest, Tsinghua, and OpenBMB. T…

X AI KOLs Following ↗ · 2026-05-25 Cached

BitCPM is a new open-source model from ModelBest, Tsinghua, and OpenBMB that uses ternary weights (-1,0,1) to run full-sized AI models on phones.

0 favorites 0 likes

#on-device

@HuggingModels: Gemma 4 is here, and it's optimized for Apple Silicon. This 4-bit quantized model runs fast on your Mac, not just in th…

X AI KOLs Timeline ↗ · 2026-05-24 Cached

Gemma 4 is a 4-bit quantized model optimized for Apple Silicon, enabling fast local inference on Mac devices, reducing reliance on cloud computing.

0 favorites 0 likes

#on-device

@AlphaSignalAI: A 66M parameter model just beat ElevenLabs on a Raspberry Pi. Text-to-speech has lived in the cloud for years. Every sp…

X AI KOLs Timeline ↗ · 2026-05-22 Cached

Supertonic 3 is a 99M parameter open-source TTS model that runs entirely on-device, beating ElevenLabs on a Raspberry Pi with 167x faster than real-time performance on a laptop CPU.

0 favorites 0 likes

#on-device

@googlegemma: We are entering a new era of on-device automation. Watch Gemma 4 E4B navigate and drive an iOS simulator directly using…

X AI KOLs Timeline ↗ · 2026-05-21 Cached

Google Gemma demonstrates Gemma 4 E4B autonomously navigating and driving an iOS simulator using Argent, showcasing on-device automation capabilities.

0 favorites 0 likes

#on-device

I'm launching the fastest and most powerful local AI image generator for iPhone

Reddit r/ArtificialInteligence ↗ · 2026-05-21

Launching PhoneDiffusion, a local AI image generator for iPhone with sub-5 second generations, privacy, and no account needed.

0 favorites 0 likes

#on-device

@FeitengLi: Hy-MT2 - a new open-source multilingual translation model that matches top-tier large models in capability, supports translation between 33 languages, and offers flexible instruction capabilities. It achieves 2-bit quantization under 500MB, making it well-suited for on-device deployment. https://modelsc…

X AI KOLs Timeline ↗ · 2026-05-21 Cached

Hy-MT2 is a new open-source multilingual translation model from Tencent Hy that supports 33 languages, offers flexible instruction capabilities, and achieves 2-bit quantization under 500MB for on-device deployment.

1 favorites 1 likes

#on-device

@AdinaYakup: Hy-MT2 New translation model family from @TencentHunyuan 1.8B / 7B / 30B-A3B MoE Supports 33 languages 1.8B > 440MB wit…

X AI KOLs Following ↗ · 2026-05-21 Cached

Tencent Hunyuan released Hy-MT2, a family of translation models up to 30B parameters with MoE, supporting 33 languages and quantized for on-device use.

0 favorites 0 likes

#on-device

@_vmlops: MICROSOFT'S FARA-7B CAN USE YOUR COMPUTER FOR YOU 7b params...clicks, scrolls, fills forms, books tickets all on its ow…

X AI KOLs Timeline ↗ · 2026-05-18 Cached

Microsoft released Fara-7B, a 7-billion parameter small language model that can autonomously control a computer to perform tasks like clicking, scrolling, and filling forms, running on-device and beating larger models like OpenAI's computer-use agent on benchmarks.

0 favorites 0 likes

#on-device

@rohanpaul_ai: So much possibilities for on-device small models. Here @adrgrondin is running Google’s Gemma 4 E2B on iPhone 17 Pro. ~4…

X AI KOLs Following ↗ · 2026-05-17 Cached

Google's Gemma 4 E2B is demonstrated running on an iPhone 17 Pro via MLX optimization, achieving ~40 tokens/second with 128K context and offline thinking mode for coding and math.

0 favorites 0 likes

on-device

Submit Feedback