spoken-language-model

#spoken-language-model

VITA-QinYu: Expressive Spoken Language Model for Role-Playing and Singing

arXiv cs.CL ↗ · 5d ago Cached

VITA-QinYu is an expressive end-to-end spoken language model capable of role-playing and singing, trained on 15.8K hours of data to outperform peers in expressiveness and conversational accuracy.

0 favorites 0 likes

spoken-language-model

VITA-QinYu: Expressive Spoken Language Model for Role-Playing and Singing

Submit Feedback