spoken-language-model

Tag

Cards List
#spoken-language-model

VITA-QinYu: Expressive Spoken Language Model for Role-Playing and Singing

arXiv cs.CL · 5d ago Cached

VITA-QinYu is an expressive end-to-end spoken language model capable of role-playing and singing, trained on 15.8K hours of data to outperform peers in expressiveness and conversational accuracy.

0 favorites 0 likes
← Back to home

Submit Feedback