Tag
UI-KOBE proposes a framework that enhances lightweight mobile GUI agents by constructing and leveraging app-specific knowledge graphs to improve task planning and execution efficiency.
MobileExplorer is a new framework that accelerates on-device inference for mobile GUI agents by performing lightweight parallel exploration of UI elements during model inference, reducing reasoning steps and latency by 23% while maintaining or improving task success rates.
SimuWoB is a synthetic benchmark with 120 challenging tasks for mobile GUI agents, using high-fidelity virtual environments and automatic reward generation. Experiments reveal that current agents achieve only 27.92% average success rate, dropping to 17.82% on long-horizon tasks, indicating substantial weaknesses in complex scenarios.