@HuggingPapers: ByteDance just dropped Bernini on Hugging Face Generate or edit videos from text, images, or references Rivals the best…

X AI KOLs Following 06/01/26, 03:37 PM Models

video-generation text-to-video image-to-video video-editing byte-dance open-source hugging-face

Summary

ByteDance released Bernini, an open-source video generation and editing model on Hugging Face that rivals top closed-source models.

ByteDance just dropped Bernini on Hugging Face Generate or edit videos from text, images, or references Rivals the best closed-source models out there https://t.co/267360jgvC

Original Article

View Cached Full Text

Cached at: 06/01/26, 05:31 PM

ByteDance just dropped Bernini on Hugging Face

Generate or edit videos from text, images, or references

Rivals the best closed-source models out there https://t.co/267360jgvC

Similar Articles

ByteDance/Bernini-R

Hugging Face Models Trending

ByteDance open-sourced Bernini-R, a video diffusion renderer that combines an MLLM-based semantic planner with a DiT-based renderer for unified video generation and editing, achieving top-tier performance on video editing.

@HuggingPapers: NVIDIA just released AnyFlow on Hugging Face The first any-step video diffusion model that generates high-quality text-…

X AI KOLs Following

NVIDIA released AnyFlow, the first any-step video diffusion model for text-to-video generation, allowing smooth quality scaling across inference budgets (4 to 50 steps).

@svpino: Huge leap in video generation! Look at the faces here. For the first time, we have a tool that doesn't change character…

X AI KOLs Following

BACH is introduced as a significant advancement in video generation, achieving unprecedented character consistency across scenes without face morphing or drift.

@hank_aibtc: https://x.com/victormustar/status/2058492201261244458/video/1… Holy cow! Meituan crushes commercial closed-source Avatar, open-source free LongCat-Video-Avatar-1.5 is here! …

X AI KOLs Timeline

Meituan open-sourced the LongCat-Video-Avatar-1.5 model, which supports generating realistic talking videos from a single photo and voice, supports multiple languages and long videos, and outperforms commercial closed-source solutions.

Show HN: Lance – image/video generation and understanding in one model

Hacker News Top

ByteDance releases Lance, a 3B parameter unified multimodal model supporting image and video generation, understanding, and editing, trained from scratch with a multi-task recipe.