Tag
A guide and Python script for controlling an Android emulator using Gemini 3.5 Flash's Computer Use capability, which allows the model to view screenshots and return actions (taps, clicks, text input) to be executed via ADB.
OpenGUI is an open-source AI phone control system that lets AI autonomously operate real Android devices to carry out long-running mobile tasks such as social media management and research. It supports remote task dispatching via Lark, Telegram, Discord, or REST API. Its underlying architecture is split into two layers — a Plan Supervisor and an Executor Graph — and supports multiple models including Claude, Qwen, and Doubao.