Tag
This paper introduces ClinicalBench and the EpiKG system, evaluating assertion-aware retrieval for clinical question answering on MIMIC-IV data across multiple LLMs. It demonstrates that handling negation and temporality in retrieval significantly improves performance over standard baselines.
This study evaluates how interactive dialogue with an LLM (via the MedSyn system) improves diagnostic accuracy for physicians in emergency care settings, showing significant gains for residents on difficult cases.