@AdinaYakup: dots.tts New TTS from Xiaohongshu (RedNote) 2B - Apache 2.0 Fully continuous architecture (no codec tokens) 48kHz synth…
Summary
Dots.tts is a new TTS model from Xiaohongshu (RedNote) with 2B parameters, Apache 2.0 license, fully continuous architecture without codec tokens, 48kHz synthesis, and zero-shot voice cloning.
View Cached Full Text
Cached at: 06/05/26, 05:19 PM
dots.tts 🔊 New TTS from Xiaohongshu (RedNote)
✨ 2B - Apache 2.0 ✨ Fully continuous architecture (no codec tokens) ✨ 48kHz synthesis ✨ Zero-shot voice cloning https://t.co/0GUYbzgm6M
Similar Articles
dots.tts 2B🎙️ SOTA TTS from RedNote
RedNote releases dots.tts, a 2B parameter open-source text-to-speech model with zero-shot voice cloning and 48 kHz synthesis.
dots.tts Technical Report
dots.tts presents a 2B-parameter continuous autoregressive TTS model trained on multilingual data, achieving state-of-the-art performance on benchmarks like Seed-TTS-Eval with low-latency streaming via CFG-aware MeanFlow distillation. The model, code, and checkpoints are released under Apache 2.0.
@Honcia13: Open-source TTS is going crazy! New weapons for industrial park scams? Tsinghua OpenBMB just released VoxCPM2: 20 billion parameters + 2 million hours of multilingual data training, 48kHz studio-quality sound! The most intense part is—no Tokenizer needed at all, performing diffusion autoregression directly in continuous latent space, maximizing detail retention!
Tsinghua University's OpenBMB has released VoxCPM2, an open-source multilingual TTS model with 20 billion parameters. It supports continuous latent space diffusion autoregressive generation without a Tokenizer, offering 48kHz studio-quality audio and powerful voice cloning and design capabilities.
@akshay_pachaar: this TTS model generates speech 167x faster than you can hear it. Supertonic is an on-device TTS engine that runs via O…
Supertonic is a new open-source TTS engine that runs on-device via ONNX, supporting 31 languages and outperforming ElevenLabs in speed, even on a Raspberry Pi without a GPU.
@AdinaYakup: Mega-ASR https://huggingface.co/zhifeixie/Mega-ASR… 1.7B Apache 2.0 Built for Noise/Reverb/Clipping/Overlapping speaker…
Mega-ASR is a 1.7B parameter robust ASR model under Apache 2.0, designed for noisy, reverberant, and overlapping speech, with an audio quality router to handle clean vs degraded audio.