mobile-ai

#mobile-ai

Gemma 12b less than 10 watts 6.5pp 1.3tg

Reddit r/LocalLLaMA ↗ · 2026-06-14

Running Gemma 12B model on a Google Pixel 10 Pro using llama.cpp achieves 6.5 tokens per second prompt processing and 1.3 tokens per second generation with under 10 watts power consumption, demonstrating efficient on-device AI inference.

0 favorites 0 likes

#mobile-ai

Energy-Efficient On-Device RAG on a Mobile NPU: System Design and Benchmark on Snapdragon X Elite

arXiv cs.CL ↗ · 2026-06-11 Cached

This paper presents the first end-to-end RAG pipeline running entirely on a mobile NPU (Qualcomm Hexagon on Snapdragon X Elite), achieving up to 18x faster LLM prefilling and 4x lower energy vs. CPU, with no quality regression.

0 favorites 0 likes

#mobile-ai

Local iPhone AI image generation is getting practical - only 3 seconds per image

Reddit r/ArtificialInteligence ↗ · 2026-06-03

Benchmark shows local Stable Diffusion 1.5 on iPhone can generate 512x512 images in as little as 3.1 seconds using optimized models like Realistic Vision V5.1 Hyper, making on-device AI image generation practical.

0 favorites 0 likes

#mobile-ai

Ready or Not, the AI Phones Are Coming

Reddit r/artificial ↗ · 2026-06-01

This article discusses the imminent arrival of AI-powered smartphones and the implications for consumers and the tech industry.

0 favorites 0 likes

#mobile-ai

The question with Gemini on Android is not just privacy. It is the action boundary.

Reddit r/AI_Agents ↗ · 2026-05-24

This article argues that the real issue with integrating Gemini deeper into Android isn't just privacy, but the action boundary—what the AI can read, suggest, draft, change, send, buy, or delete—and proposes a tiered consent model for different levels of AI agency.

0 favorites 0 likes

#mobile-ai

Vibe coding is coming to your phone

The Verge ↗ · 2026-05-20 Cached

Google and Apple are bringing AI-powered 'vibe coding' to mobile, allowing users to create custom Android apps, widgets, and shortcuts via natural language prompts, as demonstrated at Google I/O 2026 and reported for iOS.

0 favorites 0 likes

#mobile-ai

Google AI Edge Gallery v1.0.13 & v1.0.14 updates: Gemma 4 Multi-Token Prediction, Pixel TPU support, experimental MCP, new skills, now saves chat history

Reddit r/LocalLLaMA ↗ · 2026-05-19 Cached

Google AI Edge Gallery v1.0.13 & v1.0.14 updates add support for Gemma 4 with multi-token prediction, Pixel TPU optimization, experimental MCP, new skills, and chat history saving, enhancing on-device generative AI capabilities.

0 favorites 0 likes

#mobile-ai

MiniCPM-V 4.6

Product Hunt ↗ · 2026-05-12

MiniCPM-V 4.6 is an ultra-efficient 1.3B vision-language model optimized for mobile devices.

0 favorites 0 likes

#mobile-ai

@AdinaYakup: MiniCPM V4.6 a 1B MLLM that actually runs on your phone, just released by @OpenBMB 1B - Apache2.0 Runs on iOS, Android,…

X AI KOLs Following ↗ · 2026-05-11 Cached

OpenBMB has released MiniCPM V4.6, a 1B-parameter multimodal large language model optimized for mobile devices under the Apache 2.0 license. It features mixed visual token compression and claims approximately 1.5x faster throughput than Qwen3.5 0.8B while running natively on iOS, Android, and HarmonyOS.

0 favorites 0 likes

#mobile-ai

@billtheinvestor: One Phone to Disrupt the Entire 3D Virtual Tour Industry! Browser-based interactive 3D tours that used to cost six figures can now be done overnight — AI scanning tools are turning ordinary smartphones into full-featured 3D production studios

X AI KOLs Timeline ↗ · 2026-05-08 Cached

AI scanning tools are turning ordinary smartphones into full-featured 3D production studios, enabling browser-based interactive 3D virtual tours that once required six-figure budgets to be completed quickly with just a phone.

0 favorites 0 likes

#mobile-ai

@QingQ77: Let AI automatically control a real Android phone to perform long-running mobile tasks like social media, research, and content operations https://github.com/Core-Mate/OpenGUI… OpenGUI is an AI phone control system where AI operates directly on your Androi…

X AI KOLs Timeline ↗ · 2026-05-08 Cached

OpenGUI is an open-source AI phone control system that lets AI autonomously operate real Android devices to carry out long-running mobile tasks such as social media management and research. It supports remote task dispatching via Lark, Telegram, Discord, or REST API. Its underlying architecture is split into two layers — a Plan Supervisor and an Executor Graph — and supports multiple models including Claude, Qwen, and Doubao.

0 favorites 0 likes

#mobile-ai

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Papers with Code Trending ↗ · 2026-04-13 Cached

ClawGUI is an open-source framework for training, evaluating, and deploying GUI agents using reinforcement learning, featuring standardized benchmarks and cross-platform deployment to Android, iOS, and HarmonyOS.

0 favorites 0 likes

#mobile-ai

Announcing Gemma 3n preview: Powerful, efficient, mobile-first AI

Google DeepMind Blog ↗ · 2025-05-20 Cached

Google announces Gemma 3n preview, a mobile-first open AI model optimized for on-device inference on phones, tablets, and laptops. Built on a new architecture developed with hardware partners like Qualcomm and MediaTek, Gemma 3n uses innovations like Per-Layer Embeddings to achieve fast performance with minimal memory footprint (2-3GB), while supporting multimodal capabilities.

0 favorites 0 likes

#mobile-ai

Search What You See: The Tech Behind The Magic | Made by Google Podcast S9E4

YouTube AI Channels ↗ · 2026-05-08 Cached

Google has enhanced its Circle to Search feature by leveraging Gemini 3 to enable holistic scene recognition of screen content, with a particular focus on breaking down fashion ensembles into individual items and supporting virtual try-ons. This update allows users to seamlessly find alternative products and preview how they look without needing to take screenshots, thereby improving the overall visual search experience.

0 favorites 0 likes

mobile-ai

Submit Feedback