multi-shot

#multi-shot

jdopensource/JoyAI-Echo

Hugging Face Models Trending ↗ · 2026-06-02 Cached

JD Open Source releases JoyAI-Echo (Echo-LongVideo), a text-to-audio-video diffusion model capable of generating minute-level multi-shot videos with consistent character identity and voice, using DMD distillation for 7.5x speedup.

0 favorites 0 likes

#multi-shot

MSAVBench: Towards Comprehensive and Reliable Evaluation of Multi-Shot Audio-Video Generation

Hugging Face Daily Papers ↗ · 2026-05-19 Cached

MSAVBench is the first comprehensive benchmark and adaptive evaluation framework for multi-shot audio-video generation, assessing 19 models across diverse tasks and achieving high alignment with human judgment.

0 favorites 0 likes

multi-shot

jdopensource/JoyAI-Echo

MSAVBench: Towards Comprehensive and Reliable Evaluation of Multi-Shot Audio-Video Generation

Submit Feedback