Tag
Researchers introduce a new multimodal benchmark derived from Japan's National Assessment of Academic Ability, featuring 900K aggregated student responses to evaluate MLLM performance in authentic K-12 educational contexts.
This paper introduces Checkup2Action, a multimodal dataset and benchmark for generating patient-oriented action cards from clinical check-up reports, addressing the interpretability gap for laypersons.
PianoCoRe is a large-scale piano MIDI dataset unifying and refining open-source corpora with 250,046 performances of 5,625 pieces by 483 composers, featuring note-level alignments for music information retrieval and including a MIDI quality classifier and alignment refinement pipeline.