mobile-workflows

Tag

Cards List
#mobile-workflows

PhoneHarness: Harnessing Phone-Use Agents through Mixed GUI, CLI, and Tool Actions

Hugging Face Daily Papers · 2026-06-12 Cached

PhoneHarness is a mixed-action benchmark and execution framework that evaluates phone-use agents on verifiable mobile workflows, achieving a 75% pass rate and outperforming existing approaches by 12.9 percentage points through deterministic action routing and auditable execution traces.

0 favorites 0 likes
← Back to home

Submit Feedback