@AdinaYakup: dots.tts New TTS from Xiaohongshu (RedNote) 2B - Apache 2.0 Fully continuous architecture (no codec tokens) 48kHz synth…

X AI KOLs Following 06/05/26, 03:43 PM Models

text-to-speech tts xiaohongshu open-source apache-2.0 zero-shot voice-cloning

Summary

Dots.tts is a new TTS model from Xiaohongshu (RedNote) with 2B parameters, Apache 2.0 license, fully continuous architecture without codec tokens, 48kHz synthesis, and zero-shot voice cloning.

dots.tts 🔊 New TTS from Xiaohongshu (RedNote) ✨ 2B - Apache 2.0 ✨ Fully continuous architecture (no codec tokens) ✨ 48kHz synthesis ✨ Zero-shot voice cloning https://t.co/0GUYbzgm6M

Original Article

View Cached Full Text

Cached at: 06/05/26, 05:19 PM

dots.tts 🔊 New TTS from Xiaohongshu (RedNote)

✨ 2B - Apache 2.0 ✨ Fully continuous architecture (no codec tokens) ✨ 48kHz synthesis ✨ Zero-shot voice cloning https://t.co/0GUYbzgm6M

Similar Articles

dots.tts 2B🎙️ SOTA TTS from RedNote

Reddit r/LocalLLaMA

RedNote releases dots.tts, a 2B parameter open-source text-to-speech model with zero-shot voice cloning and 48 kHz synthesis.

dots.tts Technical Report

Hugging Face Daily Papers

dots.tts presents a 2B-parameter continuous autoregressive TTS model trained on multilingual data, achieving state-of-the-art performance on benchmarks like Seed-TTS-Eval with low-latency streaming via CFG-aware MeanFlow distillation. The model, code, and checkpoints are released under Apache 2.0.

@Honcia13: Open-source TTS is going crazy! New weapons for industrial park scams? Tsinghua OpenBMB just released VoxCPM2: 20 billion parameters + 2 million hours of multilingual data training, 48kHz studio-quality sound! The most intense part is—no Tokenizer needed at all, performing diffusion autoregression directly in continuous latent space, maximizing detail retention!

X AI KOLs Timeline

Tsinghua University's OpenBMB has released VoxCPM2, an open-source multilingual TTS model with 20 billion parameters. It supports continuous latent space diffusion autoregressive generation without a Tokenizer, offering 48kHz studio-quality audio and powerful voice cloning and design capabilities.

@akshay_pachaar: this TTS model generates speech 167x faster than you can hear it. Supertonic is an on-device TTS engine that runs via O…

X AI KOLs Following

Supertonic is a new open-source TTS engine that runs on-device via ONNX, supporting 31 languages and outperforming ElevenLabs in speed, even on a Raspberry Pi without a GPU.

@AdinaYakup: Mega-ASR https://huggingface.co/zhifeixie/Mega-ASR… 1.7B Apache 2.0 Built for Noise/Reverb/Clipping/Overlapping speaker…