dots.tts 2B🎙️ SOTA TTS from RedNote

Reddit r/LocalLLaMA Models

Summary

RedNote releases dots.tts, a 2B parameter open-source text-to-speech model with zero-shot voice cloning and 48 kHz synthesis.

🔗 Blog: https://rednote-hilab.github.io/dots.tts-demo/ 🔗 GitHub: https://github.com/rednote-hilab/dots.tts 🔗 Technical Report: https://arxiv.org/abs/2608.16894 dots.tts 🎙️ New open-source TTS from RedNote (Xiaohongshu) ✨ 2B parameters (Apache 2.0) ✨ Fully continuous architecture (no codec tokens) ✨ 48 kHz synthesis ✨ Zero-shot voice cloning ✨ Direct text → speech (no phoneme pipeline)
Original Article

Similar Articles

dots.tts Technical Report

Hugging Face Daily Papers

dots.tts presents a 2B-parameter continuous autoregressive TTS model trained on multilingual data, achieving state-of-the-art performance on benchmarks like Seed-TTS-Eval with low-latency streaming via CFG-aware MeanFlow distillation. The model, code, and checkpoints are released under Apache 2.0.

Aratako/Irodori-TTS-500M-v3

Hugging Face Models Trending

Irodori-TTS-500M-v3 is a Japanese TTS model based on Rectified Flow Diffusion Transformer, supporting zero-shot voice cloning and unique emoji-based style/sound effect control.

OpenBMB/VoxCPM

GitHub Trending (daily)

OpenBMB releases VoxCPM2, a 2B-parameter tokenizer-free TTS model trained on 2M+ hours of multilingual speech data, supporting 30 languages, voice design, controllable cloning, and 48kHz output.