dots.tts 2B🎙️ SOTA TTS from RedNote

Reddit r/LocalLLaMA 06/05/26, 08:21 PM Models

tts text-to-speech open-source zero-shot voice-cloning apache-2.0 rednote

Summary

RedNote releases dots.tts, a 2B parameter open-source text-to-speech model with zero-shot voice cloning and 48 kHz synthesis.

🔗 Blog: https://rednote-hilab.github.io/dots.tts-demo/ 🔗 GitHub: https://github.com/rednote-hilab/dots.tts 🔗 Technical Report: https://arxiv.org/abs/2608.16894 dots.tts 🎙️ New open-source TTS from RedNote (Xiaohongshu) ✨ 2B parameters (Apache 2.0) ✨ Fully continuous architecture (no codec tokens) ✨ 48 kHz synthesis ✨ Zero-shot voice cloning ✨ Direct text → speech (no phoneme pipeline)

Original Article

Similar Articles

@AdinaYakup: dots.tts New TTS from Xiaohongshu (RedNote) 2B - Apache 2.0 Fully continuous architecture (no codec tokens) 48kHz synth…

X AI KOLs Following

Dots.tts is a new TTS model from Xiaohongshu (RedNote) with 2B parameters, Apache 2.0 license, fully continuous architecture without codec tokens, 48kHz synthesis, and zero-shot voice cloning.

dots.tts Technical Report

Hugging Face Daily Papers

dots.tts presents a 2B-parameter continuous autoregressive TTS model trained on multilingual data, achieving state-of-the-art performance on benchmarks like Seed-TTS-Eval with low-latency streaming via CFG-aware MeanFlow distillation. The model, code, and checkpoints are released under Apache 2.0.

Aratako/Irodori-TTS-500M-v3

Hugging Face Models Trending

Irodori-TTS-500M-v3 is a Japanese TTS model based on Rectified Flow Diffusion Transformer, supporting zero-shot voice cloning and unique emoji-based style/sound effect control.

OpenBMB/VoxCPM

GitHub Trending (daily)

OpenBMB releases VoxCPM2, a 2B-parameter tokenizer-free TTS model trained on 2M+ hours of multilingual speech data, supporting 30 languages, voice design, controllable cloning, and 48kHz output.

@akshay_pachaar: this TTS model generates speech 167x faster than you can hear it. Supertonic is an on-device TTS engine that runs via O…