Tag
Fara1.5 is a family of native computer use agents trained using the FaraGen1.5 scalable data pipeline. The models achieve new state-of-the-art results on browser-use benchmarks, competing with much larger frontier models.
Microsoft launches Fara-7B, an efficient Computer Use Agent with only 7B parameters, surpassing larger models on web tasks, supporting pure local deployment, and achieving low-cost desktop automation.
H Company releases Holo3.1, a family of Vision-Language Models (0.8B to 35B) for computer use agents, supporting web, desktop, and mobile automation with native function calling and optimized quantized checkpoints for local deployment.
Microsoft released Fara-7B, a 7-billion parameter small language model that can autonomously control a computer to perform tasks like clicking, scrolling, and filling forms, running on-device and beating larger models like OpenAI's computer-use agent on benchmarks.
Asteroid is a computer use agent builder that works across Browser, Linux, and Windows, enabling users to create automated agents.
H Company releases Holotron-12B, a multimodal computer-use agent optimized for high-throughput inference using a hybrid SSM architecture. The model, post-trained on NVIDIA Nemotron, demonstrates superior efficiency and scalability for interactive agentic workloads.
Microsoft released Fara-7B, an efficient 7 billion parameter agentic small language model (SLM) for computer use tasks, achieving state-of-the-art performance within its size class and competitive with larger systems.
OpenAI released the Operator System Card detailing safety evaluations for its Computer-Using Agent (CUA) model, which combines GPT-4o's vision capabilities with reinforcement learning to interact with GUIs and perform web-based tasks on users' behalf. The card outlines risk areas including prompt injections, harmful tasks, and model mistakes, along with multi-layered mitigations based on OpenAI's Preparedness Framework.