Tag
Built an open-source pipeline that takes a single sentence and produces a cinematic reel with characters, animation, music, and narration, using FLUX.2, Wan2.2, and other models on a single AMD GPU. The pipeline includes a director agent, character generation, keyframe animation, vision critic, music, and narration stages.
Black Forest Labs shared the evolution of the Flux series models at the AI Engineer Conference and released the SelfFlow research paper, proposing a self-supervised multimodal training method that does not require external encoders.
Hugging Face introduces Modular Diffusers, a new framework for building diffusion pipelines using composable, reusable building blocks instead of monolithic pipeline implementations. The system allows flexible mixing and matching of components for image generation workflows, with integration support for visual workflow tools like Mellon.