enterprise-domains

Tag

Cards List
#enterprise-domains

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Hugging Face Daily Papers · 2026-05-13 Cached

EVA-Bench introduces a comprehensive end-to-end framework for evaluating voice agents, simulating realistic multi-turn conversations and measuring performance across voice-specific failure modes with novel accuracy (EVA-A) and experience (EVA-X) metrics. The benchmark includes 213 scenarios across enterprise domains and a perturbation suite for accent and noise robustness, revealing substantial gaps in current systems.

0 favorites 0 likes
← Back to home

Submit Feedback