process-oriented

Tag

Cards List
#process-oriented

MedBench v5: A Dynamic, Process-Oriented, and Hallucination-Aware Benchmark for Clinical Multimodal Models

arXiv cs.CL · 14h ago Cached

MedBench v5 is a dynamic, process-oriented benchmark for clinical multimodal models that integrates hallucination detection and stress testing, moving beyond static QA to evaluate reasoning and stability under information-flow stressors.

0 favorites 0 likes
← Back to home

Submit Feedback