chest-x-ray

#chest-x-ray

SDR: Set-Distance Rewards for Radiology Report Generation

arXiv cs.AI ↗ · 2d ago Cached

This paper proposes set-distance rewards for reinforcement learning in chest X-ray report generation, using embedding-based set-to-set distances between generated and reference reports. Post-training with these rewards via GRPO consistently outperforms supervised fine-tuning and exact-match rewards, and enables efficient test-time scaling.

0 favorites 0 likes

chest-x-ray

SDR: Set-Distance Rewards for Radiology Report Generation

Submit Feedback