multi-party-conversation

#multi-party-conversation

Evaluating Large Language Models Abilities for Addressee, Turn-change, and Next Speaker Prediction in Meetings

arXiv cs.CL ↗ · 18h ago Cached

This paper evaluates the abilities of large language models (LLMs) and multimodal LLMs for addressee detection, turn-change prediction, and next speaker prediction in multi-party meeting conversations. Results show text-based LLMs outperform supervised models and humans in next speaker prediction, while multimodal LLMs improve over text-only models in other tasks but remain below human performance.

0 favorites 0 likes

#multi-party-conversation

GroupMemBench: Benchmarking LLM Agent Memory in Multi-Party Conversations

arXiv cs.CL ↗ · 2026-05-15 Cached

GroupMemBench is a new benchmark for evaluating LLM agent memory in multi-party conversations, exposing failures in current memory systems with the best achieving only 46% average accuracy.

0 favorites 0 likes

multi-party-conversation

Evaluating Large Language Models Abilities for Addressee, Turn-change, and Next Speaker Prediction in Meetings

GroupMemBench: Benchmarking LLM Agent Memory in Multi-Party Conversations

Submit Feedback