long-term-history

Tag

Cards List
#long-term-history

Synthesis and Evaluation of Long-term History-aware Medical Dialogue

arXiv cs.CL · 2026-05-20 Cached

This paper introduces a framework for synthesizing long-term medical dialogue datasets using LLMs, and creates MediLongChat with three benchmark tasks to evaluate healthcare agents' memory and reasoning capabilities. Experiments show that even state-of-the-art LLMs struggle with these tasks.

0 favorites 0 likes
← Back to home

Submit Feedback