long-term-history

#long-term-history

Synthesis and Evaluation of Long-term History-aware Medical Dialogue

arXiv cs.CL ↗ · 2026-05-20 Cached

This paper introduces a framework for synthesizing long-term medical dialogue datasets using LLMs, and creates MediLongChat with three benchmark tasks to evaluate healthcare agents' memory and reasoning capabilities. Experiments show that even state-of-the-art LLMs struggle with these tasks.

0 favorites 0 likes

long-term-history

Synthesis and Evaluation of Long-term History-aware Medical Dialogue

Submit Feedback