I built agent-browser but for OS automation.

Reddit r/AI_Agents Tools

Summary

The author introduces agent-ctrl, an open-source Rust-based CLI tool for OS automation that allows AI agents to interact with native application UIs via accessibility trees.

Hey r/AI_Agents ! I was using agent-browser to power my agentic workflow, and it worked great. When I wanted to expand computer-use to the OS itself, I couldn't find a good enough tool that was open-source, so I decided to build it myself. **What is agent-ctrl?** agent-ctrl is an OS automation CLI for AI agents written in Rust for speed. **How does it work?** agent-ctrl turns native app UIs into agent-readable format, then letting you or your agent act upon UIs. It flattens and parses accessibility trees from any OS into one schema, which allows for cross-OS agents. For now it supports Windows, I'm working on MacOS and Linux right now. Looking for people open to contribute for Linux, since I do not run it myself.
Original Article

Similar Articles

We turned Cursor.ai into an OpenClaw-style multi-agent control panel

Reddit r/AI_Agents

Developers built an open-source web UI on top of the Cursor CLI that turns it into a multi-agent control panel, allowing users to run multiple Cursor agent sessions with separate workspaces, scheduling, and MCP config management from a browser-based cockpit.

bytedance/UI-TARS-desktop

GitHub Trending (daily)

ByteDance released TARS, a multimodal AI agent stack comprising Agent TARS (a CLI/Web UI-based general AI agent for GUI, browser, and terminal tasks) and UI-TARS Desktop (a native desktop application powered by the UI-TARS model for local and remote computer/browser automation). The stack integrates multimodal LLMs with MCP tools for human-like task completion.