multi-course

Tag

Cards List
#multi-course

ClinicalMC: A Benchmark for Multi-Course Clinical Decision-Making with Large Language Models

arXiv cs.AI · 19h ago Cached

ClinicalMC is a benchmark designed to evaluate large language models in multi-course clinical decision-making, featuring datasets in Chinese and English and a multi-agent evaluation framework.

0 favorites 0 likes
← Back to home

Submit Feedback