modality-credit-assignment

Tag

Cards List
#modality-credit-assignment

Bad Seeing or Bad Thinking? Rewarding Perception for Vision-Language Reasoning

arXiv cs.AI · 2026-05-15 Cached

This paper introduces a reinforcement learning framework that improves perception-reasoning synergy in vision-language models by explicitly rewarding perceptual fidelity, using a 'blindfolded reasoning' proxy and structured verbal verification to address ambiguity in modality credit assignment.

0 favorites 0 likes
← Back to home

Submit Feedback