@Teknium: Give our early preview of Computer Use (with ANY model) a try today! Built into the latest Hermes Agent and powered by …
Summary
Teknium introduces an early preview of Computer Use built into the Hermes Agent and powered by TryCua, enabling any AI model to interact with and control a desktop environment in the background without overriding direct user input.
Similar Articles
@NousResearch: Computer use with any model Hermes Agent × @trycua
NousResearch announces that their Hermes model can now be used for computer use tasks in combination with the trycua framework.
@intheworldofai: Hermes Agent is evolving FAST. In just the past week, Nous Research added: - A full WebUI/Desktop App - Background Comp…
Nous Research releases a major update to the open-source Hermes Agent, adding native macOS background computer use, multi-agent orchestration via Kanban, and Lightpanda browser integration.
@VincentLogic: Found a pretty interesting AI assistant client! Hermes Agent, a clean-looking Chinese desktop app. Feature integration is quite comprehensive: - Conversation & Session Management - Multi-model Support - Skill & Tool Integration - Scheduled Tasks & Gateway Configuration From the interface, it can help you: Search web, set reminders, summarize emails,…
Hermes Agent is a cross-platform AI assistant desktop client developed based on Electron. It supports multi-model switching, skill integration, scheduled tasks, and more, aiming to provide users with a unified AI productivity workspace.
Computer-Using Agent
OpenAI introduced the Computer-Using Agent (CUA), a model combining GPT-4o's vision with reinforcement learning to interact with GUIs like a human, powering the new Operator agent. CUA sets new state-of-the-art benchmarks including 38.1% on OSWorld and 58.1% on WebArena, and is available as a research preview for ChatGPT Pro users in the US.
Introducing the Gemini 2.5 Computer Use model
Google releases Gemini 2.5 Computer Use model via the Gemini API, enabling developers to build AI agents that can interact with user interfaces through clicking, typing, and scrolling. The model outperforms alternatives on web and mobile control benchmarks with lower latency and is available in preview on Google AI Studio and Vertex AI.