@songhan_mit: the causal version of SANA world model is released, enabling close to real-time inference on a single H100:
Summary
The causal version of the SANA world model has been released, enabling near real-time inference for video generation on a single H100 GPU, with open-source code and a demo.
View Cached Full Text
Cached at: 06/08/26, 05:25 PM
the causal version of SANA world model is released, enabling close to real-time inference on a single H100:
Enze Xie (@xieenze_jr): 🚀 Causal realtime streaming SANA-WM open-sourced!
Thanks to @reactorworld for serving the model — try demo: https://t.co/88zGRl9KL6
~0.93x realtime on single H100, watch 60s 720p live + 6-DoF camera control.
Code:
Similar Articles
@songhan_mit: Explore SANA World Model, using hybrid linear attention, efficient and fast!
SANA World Model is a new AI model that uses hybrid linear attention for efficiency and speed.
SANA-WM, a 2.6B open-source world model for 1-minute 720p video
SANA-WM is a 2.6 billion parameter open-source world model capable of generating 1-minute 720p videos.
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer
SANA-WM is a 2.6B-parameter open-source world model that generates high-fidelity 720p minute-scale videos with precise camera control, achieving industrial-level quality while significantly reducing computational requirements.
Efficient-Large-Model/SANA-WM_bidirectional
SANA-WM is an efficient 2.6B-parameter open-source world model for minute-scale video generation with precise camera control. It uses a hybrid linear diffusion transformer and a two-stage pipeline to produce 720p videos from images and text prompts.
@songhan_mit: SANA Streaming: V2V on a single 5090
SANA Streaming enables video-to-video generation on a single NVIDIA RTX 5090 GPU.