@hank_aibtc: https://x.com/victormustar/status/2058492201261244458/video/1… Holy cow! Meituan crushes commercial closed-source Avatar, open-source free LongCat-Video-Avatar-1.5 is here! …
Summary
Meituan open-sourced the LongCat-Video-Avatar-1.5 model, which supports generating realistic talking videos from a single photo and voice, supports multiple languages and long videos, and outperforms commercial closed-source solutions.
View Cached Full Text
Cached at: 05/25/26, 04:44 AM
https://x.com/victormustar/status/2058492201261244458/video/1…
Holy shit! Meitu just wrecked commercial closed-source Avatars —
the open-source, free LongCat-Video-Avatar-1.5 is here!
Drop in a photo + a voice clip (Chinese, English, Japanese — any language works),
and it instantly generates a talking video with perfectly synced lips, natural blinking and head movements, and wild hand gestures.
Long videos stay stable, multi-person conversations keep each person separate,
it even handles singing and dancing, and works with anime, animals, and real people!
All those problems HeyGen, Kling, etc. used to have — lip-sync failures, face drift, English-only?
All gone.
Now open-source under MIT, runs locally, batch generation is a breeze!
Content creators, e-commerce sellers, virtual lecturers, YouTubers who don’t want to show their faces, multilingual marketers… this is a productivity jackpot!
Core Idea:
LongCat-Video-Avatar-1.5 is ideal for
Talking Head Avatars (digital human avatars),
especially for e-commerce marketing.
Use case: Input a reference image + an audio clip (recorded script), and generate a product promotion video with natural lip-sync and stable identity consistency.
Advantages: Supports long video continuation,
multi-person dialogues, multilingual speech, no identity drift,
perfect for live replay or short video pre-rendering.
Project + HF Demo below:
Similar Articles
meituan-longcat/LongCat-Video-Avatar-1.5 · Hugging Face
LongCat-Video-Avatar 1.5 is an upgraded open-source framework for audio-driven human video generation with improved lip synchronization, production-ready stability, and efficient 8-step inference.
@victormustar: New: LongCat just dropped an excellent open-source talking-avatar model (probably SOTA) + MIT licensed Made a Hugging F…
LongCat released an open-source talking-avatar model (likely state-of-the-art) under MIT license, with a Hugging Face demo, enabling various applications like AI tutors, dubbing, and coding agents.
@Saboo_Shubham_: INSANE...this is an Open Source Video model available for free on Hugging Face. LongCat just dropped an amazing video a…
LongCat has released an open source video avatar model on Hugging Face that is free to use and capable of impressive feats.
@CopyRebeldia: A Chinese lab has just humiliated half the video industry. You upload a photo and an audio, and out comes an avatar spe…
Un laboratorio chino ha lanzado LongCat-Avatar, una herramienta open source que genera un avatar sincronizado con audio a partir de una foto y un audio, revolucionando la producción de video.
@QT9277: "No way, AI voice synthesis has gotten this insane???" I was browsing GitHub today and was completely stunned. VoxCPM2, trending #1, over 20k stars, blowing up overseas. I thought it was another PPT open-source project, but after carefully checking the demo—my ears really couldn't tell which one was real. …
Introducing VoxCPM2, a completely free for commercial use, open-source multilingual voice synthesis model supporting voice design, cloning, and 48kHz high-quality output, ranked #1 on GitHub trending.