@multimodalart: they extracted only the audio bit of LTX-2.3, fine-tuned for TTS task and achieved SOTA TTS emotional control??? try it…

X AI KOLs Following Tools

Summary

A fine-tuned version of the LTX-2.3 model's audio component achieves state-of-the-art emotional control in text-to-speech, now available as a Hugging Face Space called DramaBox by ResembleAI.

they extracted only the audio bit of LTX-2.3, fine-tuned for TTS task and achieved SOTA TTS emotional control??? try it for yourself. So far I'm very impressed! https://t.co/A8qD9dc78b
Original Article
View Cached Full Text

Cached at: 05/15/26, 11:09 PM

they extracted only the audio bit of LTX-2.3, fine-tuned for TTS task and achieved SOTA TTS emotional control???

try it for yourself. So far I’m very impressed!

https://t.co/A8qD9dc78b


DramaBox - a Hugging Face Space by ResembleAI

Source: https://huggingface.co/spaces/ResembleAI/Dramabox Fetching metadata from the HF Docker repository...

Similar Articles

ResembleAI/Dramabox

Hugging Face Models Trending

Dramabox is an expressive text-to-speech model by Resemble AI that uses prompt-driven control for speaker identity, emotion, and delivery, with optional voice cloning via a 10-second reference. Built on the LTX-2.3 audio diffusion transformer, it is open-sourced on Hugging Face.

DramaBox by Resemble AI

Product Hunt

DramaBox by Resemble AI converts scene descriptions into AI-generated vocal performances.