gui-grounding

#gui-grounding

Trust the Right Teacher: Quality-Aware Self-Distillation for GUI Grounding

Hugging Face Daily Papers ↗ · 5d ago Cached

Proposes quality-aware self-distillation for GUI grounding, improving coordinate-token teacher signals via correctness-aware gating and probability scaling to enhance vision-language model performance.

0 favorites 0 likes

#gui-grounding

VISTA: View-Consistent Self-Verified Training for GUI Grounding

Hugging Face Daily Papers ↗ · 2026-06-12 Cached

VISTA introduces a view-consistent self-verified training method for GUI grounding that improves GRPO-based coordinate prediction by using multiple target-preserving views, achieving consistent accuracy gains across benchmarks.

0 favorites 0 likes

#gui-grounding

DRS-GUI: Dynamic Region Search for Training-Free GUI Grounding

arXiv cs.AI ↗ · 2026-05-18 Cached

DRS-GUI proposes a training-free dynamic region search framework for GUI grounding, using a lightweight UI Perceptor with human-like perceptual actions and Monte Carlo Tree Search to progressively locate instruction-relevant elements. Experiments show a 14% improvement on ScreenSpot-Pro for both general and GUI-specific MLLMs.

0 favorites 0 likes

#gui-grounding

@HuggingPapers: Microsoft just released Phi-Ground-Any on Hugging Face A 4B parameter vision model for GUI grounding that achieves SOTA…

X AI KOLs Following ↗ · 2026-05-09 Cached

Microsoft has released Phi-Ground-Any, a 4B parameter vision model for GUI grounding on Hugging Face that achieves state-of-the-art results, enabling AI agents to precisely interact with screen elements.

0 favorites 0 likes

gui-grounding

Trust the Right Teacher: Quality-Aware Self-Distillation for GUI Grounding

VISTA: View-Consistent Self-Verified Training for GUI Grounding

DRS-GUI: Dynamic Region Search for Training-Free GUI Grounding

@HuggingPapers: Microsoft just released Phi-Ground-Any on Hugging Face A 4B parameter vision model for GUI grounding that achieves SOTA…

Submit Feedback