role-specific-training

Tag

Cards List
#role-specific-training

Visual Para-Thinker++: A Single-Policy Multi-Agent Framework for Visual Reasoning

Hugging Face Daily Papers · 6d ago Cached

Visual Para-Thinker++ proposes a single-policy multi-agent framework for visual reasoning that uses role-conditioned agents (Main, Worker, Summary) and dedicated training methods to reduce hallucinations and improve efficiency, outperforming baselines on hallucination-sensitive benchmarks.

0 favorites 0 likes
← Back to home

Submit Feedback