标签
Supertonic 3是一个99M参数的开源TTS模型,完全在设备上运行,在树莓派上击败了ElevenLabs,在笔记本电脑CPU上的性能是实时的167倍。
Supertonic is a lightning-fast, on-device TTS model with 99M parameters, supporting 31 languages. It runs locally with no API costs, outperforms cloud TTS on accuracy for numbers, phone numbers, and technical terms, and can be installed via Python, Node.js, Rust, Go, and more.
本文重点介绍了 Supertonic,这是一个开源的文本转语音模型,完全在本地设备运行。与 ElevenLabs 和 OpenAI 等基于云端的服务相比,它在速度和格式准确性方面均表现出色。