@Teknium: Give our early preview of Computer Use (with ANY model) a try today! Built into the latest Hermes Agent and powered by …

X AI KOLs Following Tools

Summary

Teknium introduces an early preview of Computer Use built into the Hermes Agent and powered by TryCua, enabling any AI model to interact with and control a desktop environment in the background without overriding direct user input.

Give our early preview of Computer Use (with ANY model) a try today! Built into the latest Hermes Agent and powered by @trycua - opens the door to any model, not just the frontier models in special modes - to control your actual computer. Best part, it doesnt take over your PC - you can continue to work and operate with full control of your keyboard, mouse, and screen - works entirely in the background!
Original Article

Similar Articles

@VincentLogic: Found a pretty interesting AI assistant client! Hermes Agent, a clean-looking Chinese desktop app. Feature integration is quite comprehensive: - Conversation & Session Management - Multi-model Support - Skill & Tool Integration - Scheduled Tasks & Gateway Configuration From the interface, it can help you: Search web, set reminders, summarize emails,…

X AI KOLs Timeline

Hermes Agent is a cross-platform AI assistant desktop client developed based on Electron. It supports multi-model switching, skill integration, scheduled tasks, and more, aiming to provide users with a unified AI productivity workspace.

Computer-Using Agent

OpenAI Blog

OpenAI introduced the Computer-Using Agent (CUA), a model combining GPT-4o's vision with reinforcement learning to interact with GUIs like a human, powering the new Operator agent. CUA sets new state-of-the-art benchmarks including 38.1% on OSWorld and 58.1% on WebArena, and is available as a research preview for ChatGPT Pro users in the US.

Introducing the Gemini 2.5 Computer Use model

Google DeepMind Blog

Google releases Gemini 2.5 Computer Use model via the Gemini API, enabling developers to build AI agents that can interact with user interfaces through clicking, typing, and scrolling. The model outperforms alternatives on web and mobile control benchmarks with lower latency and is available in preview on Google AI Studio and Vertex AI.