computer-use

#computer-use

Giving AI a real phone feels more interesting than another browser agent

Reddit r/openclaw ↗ · 7h ago

OpenGUI is highlighted as a novel AI agent platform that utilizes actual Android devices for task execution, offering a more realistic interface than traditional browser-based agents.

0 favorites 0 likes

#computer-use

@axiaisacat: ByteDance has open-sourced an AI called UI-TARS that can directly control your computer. It is open-source, free, and runs locally. You tell it using natural language: 'Book me the earliest flight from San Francisco to New York on September 1st on Priceline', 'Set the auto-save delay in VS Code to 500ms', '...'

X AI KOLs Timeline ↗ · 14h ago

ByteDance has open-sourced UI-TARS, an AI model capable of directly controlling computer interfaces via mouse and keyboard for tasks like booking flights or configuring software. Available in 2B, 7B, and 72B parameter sizes, it runs locally and offers a free alternative to paid services like Anthropic's Computer Use.

0 favorites 0 likes

#computer-use

I built agent-browser but for OS automation.

Reddit r/AI_Agents ↗ · 22h ago

The author introduces agent-ctrl, an open-source Rust-based CLI tool for OS automation that allows AI agents to interact with native application UIs via accessibility trees.

0 favorites 0 likes

#computer-use

@QingQ77: Let AI automatically control a real Android phone to perform long-running mobile tasks like social media, research, and content operations https://github.com/Core-Mate/OpenGUI… OpenGUI is an AI phone control system where AI operates directly on your Androi…

X AI KOLs Timeline ↗ · 2d ago Cached

OpenGUI is an open-source AI phone control system that lets AI autonomously operate real Android devices to carry out long-running mobile tasks such as social media management and research. It supports remote task dispatching via Lark, Telegram, Discord, or REST API. Its underlying architecture is split into two layers — a Plan Supervisor and an Executor Graph — and supports multiple models including Claude, Qwen, and Doubao.

0 favorites 0 likes

#computer-use

@RoundtableSpace: A fully local desktop automation agent that sees your screen, controls your mouse and keyboard, and completes tasks in …

X AI KOLs Timeline ↗ · 3d ago Cached

Roundtable Space is a fully local, open-source desktop automation agent that uses natural language to control screens, mice, and keyboards across applications, rapidly accumulating over 29k GitHub stars.

0 favorites 0 likes

#computer-use

Ara

Product Hunt ↗ · 4d ago

Ara is a new agentic AI product that functions as a computer-use agent integrated into the user interface.

0 favorites 0 likes

#computer-use

@mamagnus00: The Google Slides UI is hostile to agents. Watch this agent's thoughts while modifying its own tools to handle the weir…

X AI KOLs Following ↗ · 2026-04-21 Cached

A developer demonstrates an AI agent autonomously modifying its own browser automation tools to handle edge cases in the Google Slides interface.

0 favorites 0 likes

#computer-use

@oshaikh13: very cool idea @OpenAI I’m really excited about this research preview- learning from how people interact with their com…

X AI KOLs Following ↗ · 2026-04-20

An OpenAI research preview explores learning from how people interact with their computers beyond chat, accompanied by a new arxiv paper on the topic.

0 favorites 0 likes

#computer-use

@injaneity: i reverse engineered @OpenAI's Codex Computer Use and built pi-computer-use: a model agnostic computer use tool for my …

X AI KOLs Timeline ↗ · 2026-04-20 Cached

A developer reverse-engineered OpenAI's Codex Computer Use to build pi-computer-use, an open-source, model-agnostic macOS automation tool featuring ax-first navigation and vision fallback for supported models.

0 favorites 0 likes

#computer-use

@sama: Lots of major improvements to Codex! Computer use is a real update for me; it feels even more useful than I expected. I…

X AI KOLs ↗ · 2026-04-16 Cached

Sam Altman announces major improvements to Codex, highlighting a new computer use capability that allows the model to control Mac applications in parallel without interfering with user workflows.

0 favorites 0 likes

#computer-use

Codex for (almost) everything

OpenAI Blog ↗ · 2026-04-16 Cached

OpenAI releases a major update to Codex, enabling it to operate computers via cursor control, generate images, manage long-term tasks with memory, and deeply integrate with developer workflows like SSH and PR reviews.

0 favorites 0 likes

#computer-use

Meet HoloTab by HCompany. Your AI browser companion.

Hugging Face Blog ↗ · 2026-04-15 Cached

HCompany has launched HoloTab, a Chrome extension powered by the Holo3 computer-use AI model, designed to automate web tasks and create reusable routines for users without technical skills.

0 favorites 0 likes

#computer-use

Introducing GPT-5.4

OpenAI Blog ↗ · 2026-03-05 Cached

OpenAI is releasing GPT-5.4 and GPT-5.4 Pro across ChatGPT, the API, and Codex, featuring native computer-use capabilities, 1M token context, improved reasoning and coding, and state-of-the-art performance on professional knowledge work benchmarks. It is described as OpenAI's most capable and token-efficient reasoning model to date.

0 favorites 0 likes

#computer-use

Introducing the Gemini 2.5 Computer Use model

Google DeepMind Blog ↗ · 2025-10-23 Cached

Google releases Gemini 2.5 Computer Use model via the Gemini API, enabling developers to build AI agents that can interact with user interfaces through clicking, typing, and scrolling. The model outperforms alternatives on web and mobile control benchmarks with lower latency and is available in preview on Google AI Studio and Vertex AI.

0 favorites 0 likes

#computer-use

Computer-Using Agent

OpenAI Blog ↗ · 2025-01-23 Cached

OpenAI introduced the Computer-Using Agent (CUA), a model combining GPT-4o's vision with reinforcement learning to interact with GUIs like a human, powering the new Operator agent. CUA sets new state-of-the-art benchmarks including 38.1% on OSWorld and 58.1% on WebArena, and is available as a research preview for ChatGPT Pro users in the US.

0 favorites 0 likes

#computer-use

AI News: Anthropic Went Crazy This Week!

YouTube AI Channels ↗ · 2026-04-22 Cached

Anthropic launched 74 updates in 52 days including Computer Use, Projects, and Claude Code Auto Mode, while Google countered with Gemini 3.1 Flash Live, vibe-coded browser demos, and Lyria 3 Pro music tools, as GenSpark enters with $20/month unlimited AI through 2026.

0 favorites 0 likes

computer-use

Submit Feedback