multi-hop-tasks

#multi-hop-tasks

GTA: Generating Long-Horizon Tasks for Web Agents at Scale

arXiv cs.AI ↗ · 2026-05-29 Cached

This paper introduces GTA, a scalable framework for automatically generating long-horizon, multi-hop web agent tasks with executable trajectories, addressing the lack of process-level supervision in web agent benchmarks. The framework integrates crawling, retrieval-based seeding, and automated quality control to produce realistic tasks across multiple websites.

0 favorites 0 likes

multi-hop-tasks

GTA: Generating Long-Horizon Tasks for Web Agents at Scale

Submit Feedback