computer-use-agents

Tag

Cards List
#computer-use-agents

ReVision: Scaling Computer-Use Agents via Temporal Visual Redundancy Reduction

arXiv cs.CL · 16h ago Cached

This paper introduces ReVision, a method to reduce token usage in computer-use agents by removing redundant visual patches from consecutive screenshots. It demonstrates that this efficiency gain allows agents to process longer trajectories and improve performance on benchmarks like OSWorld.

0 favorites 0 likes
#computer-use-agents

ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents

Hugging Face Daily Papers · yesterday Cached

ToolCUA is a new agent framework that optimizes GUI-tool path selection for computer use agents through staged training and reinforcement learning. It achieves state-of-the-art performance on OSWorld-MCP by effectively interleaving GUI actions and high-level tool calls.

0 favorites 0 likes
#computer-use-agents

Securing Computer-Use Agents: A Unified Architecture-Lifecycle Framework for Deployment-Grounded Reliability

arXiv cs.CL · 2d ago Cached

This academic paper proposes a unified architecture-lifecycle framework for securing computer-use agents (CUAs) as they transition from benchmarks to real-world software environments. It analyzes reliability challenges across perception, decision, execution layers and creation, deployment, operation, maintenance stages.

0 favorites 0 likes
#computer-use-agents

On the Reliability of Computer Use Agents

Hugging Face Daily Papers · 2026-04-20 Cached

A preprint analyzing why computer-use agents succeed once but fail on repeated executions, attributing unreliability to execution stochasticity, task ambiguity, and behavioral variability, and advocating repeated evaluation and stable strategies.

0 favorites 0 likes
#computer-use-agents

Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents

Papers with Code Trending · 2025-04-01 Cached

Agent S2 is a new compositional framework for computer use agents that achieves state-of-the-art performance on multiple benchmarks by utilizing Mixture-of-Grounding and Proactive Hierarchical Planning.

0 favorites 0 likes
#computer-use-agents

trycua/cua

GitHub Trending (daily) · 8h ago Cached

trycua/cua is an open-source toolkit and Python library for building, benchmarking, and deploying computer-use agents, featuring macOS background automation and cross-platform agent-ready sandboxes.

0 favorites 0 likes
← Back to home

Submit Feedback