@HuggingPapers: ByteDance just dropped Bernini on Hugging Face Generate or edit videos from text, images, or references Rivals the best…
Summary
ByteDance released Bernini, an open-source video generation and editing model on Hugging Face that rivals top closed-source models.
View Cached Full Text
Cached at: 06/01/26, 05:31 PM
ByteDance just dropped Bernini on Hugging Face
Generate or edit videos from text, images, or references
Rivals the best closed-source models out there https://t.co/267360jgvC
Similar Articles
ByteDance/Bernini-R
ByteDance open-sourced Bernini-R, a video diffusion renderer that combines an MLLM-based semantic planner with a DiT-based renderer for unified video generation and editing, achieving top-tier performance on video editing.
@HuggingPapers: NVIDIA just released AnyFlow on Hugging Face The first any-step video diffusion model that generates high-quality text-…
NVIDIA released AnyFlow, the first any-step video diffusion model for text-to-video generation, allowing smooth quality scaling across inference budgets (4 to 50 steps).
@svpino: Huge leap in video generation! Look at the faces here. For the first time, we have a tool that doesn't change character…
BACH is introduced as a significant advancement in video generation, achieving unprecedented character consistency across scenes without face morphing or drift.
@hank_aibtc: https://x.com/victormustar/status/2058492201261244458/video/1… Holy cow! Meituan crushes commercial closed-source Avatar, open-source free LongCat-Video-Avatar-1.5 is here! …
Meituan open-sourced the LongCat-Video-Avatar-1.5 model, which supports generating realistic talking videos from a single photo and voice, supports multiple languages and long videos, and outperforms commercial closed-source solutions.
Show HN: Lance – image/video generation and understanding in one model
ByteDance releases Lance, a 3B parameter unified multimodal model supporting image and video generation, understanding, and editing, trained from scratch with a multi-task recipe.