multi-party-conversation

Tag

Cards List
#multi-party-conversation

Evaluating Large Language Models Abilities for Addressee, Turn-change, and Next Speaker Prediction in Meetings

arXiv cs.CL · 18h ago Cached

This paper evaluates the abilities of large language models (LLMs) and multimodal LLMs for addressee detection, turn-change prediction, and next speaker prediction in multi-party meeting conversations. Results show text-based LLMs outperform supervised models and humans in next speaker prediction, while multimodal LLMs improve over text-only models in other tasks but remain below human performance.

0 favorites 0 likes
#multi-party-conversation

GroupMemBench: Benchmarking LLM Agent Memory in Multi-Party Conversations

arXiv cs.CL · 2026-05-15 Cached

GroupMemBench is a new benchmark for evaluating LLM agent memory in multi-party conversations, exposing failures in current memory systems with the best achieving only 46% average accuracy.

0 favorites 0 likes
← Back to home

Submit Feedback