@laowangbabababa: Shocked! Dr. Qi on Douyin sells a 500k digital human agent per day, and I built it in 2 minutes. Using the Pixelle-Video project, which already has 22k stars. It supports digital human lip-syncing, motion transfer, and image-to-video. Supports ComfyUI, input a topic, from script writing to adding...

X AI KOLs Timeline Tools

Summary

Introducing the open-source project Pixelle-Video: a fully automated AI short video engine. Input a topic and it automatically generates a video with script, images, voiceover, and background music. Supports local and cloud models, modular design allows flexible replacement of each component model.

Shocked! Dr. Qi on Douyin sells a 500k digital human agent per day, and I built it in 2 minutes. Using the Pixelle-Video project, which already has 22k stars. It supports digital human lip-syncing, motion transfer, and image-to-video. Supports ComfyUI, input a topic, from script writing to adding BGM to final output, fully automated video production. Old Wang deployed it locally and made a short video, which is an illustration-style video. If you need more complex videos, you need to configure cloud models like seedance2, kling, etc. The best part of Pixelle-Video is that it makes video production fully configurable, supporting both locally deployed models and cloud-based large models. Script, visuals, voiceover, and editing are split into four replaceable modules. Behind each module, you can switch models freely, making it very convenient. > Script layer: LLM reads the topic and outputs a structured script with timestamps, each sentence corresponding to a segment of visuals. > Visual layer: Each sentence of the script is converted into an image generation prompt, sent to ComfyUI or directly to DashScope to generate images. Image-to-video and digital human lip-syncing also go through this layer. > Voiceover layer: The original script goes through TTS synthesis, supporting multiple languages and voice cloning, no need to record your own audio. > Compositing layer: Synchronize visuals with the voiceover timeline, overlay BGM, and output MP4. Repo: http://github.com/AIDC-AI/Pixelle-Video… P.S. Think of it and you can produce a video. With a little AI programming, this project can be customized into a digital human agent suitable for various industries.
Original Article
View Cached Full Text

Cached at: 06/14/26, 07:39 AM

🎬 Pixelle-Video — AI Fully Automatic Short Video Engine

English | 中文

👤 Digital Human Lip Sync

Korean Digital Human Lip Sync

🖼️ Image-to-Video

Cartoon Video

💃 Motion Transfer

Dancing Cat

🌄 Humanities Documentary - Default Video Template

The scenery along the journey is mesmerizing

🔍 Cultural Deconstruction - Default Video Template

Santa ID

🔭 Scientific Speculation - Default Video Template

Why haven’t we found alien civilizations yet?

🌱 Personal Growth - Cloned Voice

How to improve yourself

🧠 Deep Thinking - Default Template

How to understand antifragility

🏯 History and Culture - Fixed Frame

Zizhi Tongjian

☀️ Emotional - Cloned Voice

Warm Winter Sun

📜 Novel Explanation - Original Script

Battle Through the Heavens

🧬 Knowledge Popularization - Qwen Image Generation

Wellness Tips

💰 Side Hustle - Movie Template

Earning Extra Income on the Side

🏛️ History Commentary - Custom Template

Zizhi Tongjian Revelations

Similar Articles

@Russell3402: Alibaba International's open-source AI-powered fully automated short video engine, Pixelle-Video. Simply input a topic, and it automatically generates a complete short video. From copywriting and voiceovers to image selection and editing, everything is handled by AI. GitHub:

X AI KOLs Timeline

Alibaba International has open-sourced the AI-powered fully automated short video engine Pixelle-Video, allowing users to generate complete short videos—including copywriting, voiceovers, images, and editing—by simply inputting a topic.

@yhslgg: Old Yang shares another gem open-source tool—KrillinAI, 10,000 stars on GitHub, a must-see for multilingual audio/video content! In a nutshell: from video download to subtitle translation, AI dubbing, video compositing, the entire pipeline is covered, and it can even auto-generate platform covers, supporting Bilibili, Douyin, Xiaohongshu, YouTube…

X AI KOLs Timeline

KrillinAI is an open-source tool that integrates the entire workflow of video downloading, subtitle translation, AI dubbing, and video compositing. It supports context-aware translation, voice cloning, auto layout, and cover generation, and is compatible with multiple AI models, suitable for multilingual audio/video content creation and distribution.

AIDC-AI/Pixelle-Video

GitHub Trending (daily)

Pixelle-Video is an open-source, fully-automated short-video engine that turns a single topic into a complete video with AI-generated script, images, voice-over, BGM and editing, built on a modular ComfyUI workflow.

@QT9277: "No way, AI voice synthesis has gotten this insane???" I was browsing GitHub today and was completely stunned. VoxCPM2, trending #1, over 20k stars, blowing up overseas. I thought it was another PPT open-source project, but after carefully checking the demo—my ears really couldn't tell which one was real. …

X AI KOLs Timeline

Introducing VoxCPM2, a completely free for commercial use, open-source multilingual voice synthesis model supporting voice design, cloning, and 48kHz high-quality output, ranked #1 on GitHub trending.