mobile-agents

Tag

Cards List
#mobile-agents

MIRAGE: Mobile Agents with Implicit Reasoning and Generative World Models

arXiv cs.AI · 4d ago Cached

MIRAGE is a framework for mobile GUI agents that replaces verbose chain-of-thought reasoning with compact continuous latent representations, incorporating a generative world model perspective to predict future screen states before acting. On AndroidWorld and AndroidControl benchmarks, it achieves competitive or superior performance while reducing generated tokens by over 75%.

0 favorites 0 likes
#mobile-agents

Perceive Before Reasoning: A Pre-Reasoning Perception Framework for Efficient and Reliable Proactive Mobile Agents

arXiv cs.AI · 5d ago Cached

This paper proposes a Pre-Reasoning Perception Framework (PRPF) for proactive mobile agents, decoupling intervention timing from assistance generation to improve efficiency and reduce false triggers.

0 favorites 0 likes
#mobile-agents

Is state tracking the hardest part of phone-use AI?

Reddit r/AI_Agents · 2026-05-20

The author observes that the hardest part of phone-use AI agents is tracking state changes, as mobile interfaces have more dynamic and interruptive UI changes compared to desktop, and asks for others' experience.

0 favorites 0 likes
← Back to home

Submit Feedback