@DengHokin: I am super excited to share that I launch a weekly Video Model Journal Club. Every week we pick one paper and go deep, …

X AI KOLs Timeline 06/16/26, 01:20 AM Events

video-models journal-club weekly-event world-models video-generation diffusion flow-matching

Summary

The author launches a weekly Video Model Journal Club covering video generation, world models, physical reasoning, diffusion, flow matching, etc. The first in-person talk will be by Yilun Du on Embodied Reasoning with World Models.

I am super excited to share that I launch a weekly Video Model Journal Club. Every week we pick one paper and go deep, i.e. video generation, world models, physical reasoning, diffusion, flow matching, and everything in between. This Friday, we will have Yilun Du @du_yilun from @Harvard giving us a talk on Embodied Reasoning with World Models in person at @moonlake - really grateful for Fan-yun Sun @sunfanyun, Charlotte @xia_char and Shin @shinshin_oob for hosting. Register for in-person via Luma: https://luma.com/video-model #video #AI #SF

Original Article

View Cached Full Text

Cached at: 06/16/26, 11:53 AM

This Friday, we will have Yilun Du @du_yilun from @Harvard giving us a talk on Embodied Reasoning with World Models in person at @moonlake - really grateful for Fan-yun Sun @sunfanyun, Charlotte @xia_char and Shin @shinshin_oob for hosting.

#video #AI #SF

Video Model Journal Club · Events Calendar

Source: https://luma.com/video-model Every week we pick one paper and go deep — video generation, world models, physical reasoning, diffusion, flow matching, and everything in between.

Events

Cover Image for Embodied Reasoning with World Models by Yilun Du

Embodied Reasoning with World Models by Yilun Du

By Hokin Deng, Fan-Yun Sun, Charlotte Xia, Shin & 2 others

San Francisco, United States

Cover Image for Think Visually, Reason Textually: Vision-Language Synergy in ARC by Beichen Zhang

Think Visually, Reason Textually: Vision-Language Synergy in ARC by Beichen Zhang

Cover Image for Demystifying Video Reasoning by Ruisi Wang

Demystifying Video Reasoning by Ruisi Wang

Cover Image for Video Reasoning Models by Zhongang Cai

Video Reasoning Models by Zhongang Cai

Cover Image for Video Models Can Reason with Verifiable Rewards by Tinghui Zhu

Video Models Can Reason with Verifiable Rewards by Tinghui Zhu

Cover Image for Video Models Are Zero-Shot Learners and Reasoners by Thaddäus Wiedemer

Video Models Are Zero-Shot Learners and Reasoners by Thaddäus Wiedemer

Cover Image for Do Joint Audio-Video Generation Models Understand Physics? by Zijun Cui

@DengHokin: I am super excited to share that I launch a weekly Video Model Journal Club. Every week we pick one paper and go deep, …

Video Model Journal Club · Events Calendar

Events

Embodied Reasoning with World Models by Yilun Du

Think Visually, Reason Textually: Vision-Language Synergy in ARC by Beichen Zhang

Demystifying Video Reasoning by Ruisi Wang

Video Reasoning Models by Zhongang Cai

Video Models Can Reason with Verifiable Rewards by Tinghui Zhu

Video Models Are Zero-Shot Learners and Reasoners by Thaddäus Wiedemer

Do Joint Audio-Video Generation Models Understand Physics? by Zijun Cui

Similar Articles

@swyx: full writeup and links here

@aiDotEngineer: Building Generative Image & Video models at Scale https://youtube.com/watch?v=xOP1PM8fwnk… A lot of interest in image g…

Why Video Agent models are next — Ethan He, xAI Grok Imagine (98 minute read)

@HuggingPapers: Top AI Papers of The Week (May 25-31): - Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players - SkillO…

Qwen's Embodied World Modeling (28 minute read)

Submit Feedback

Similar Articles

@swyx: full writeup and links here

@aiDotEngineer: Building Generative Image & Video models at Scale https://youtube.com/watch?v=xOP1PM8fwnk… A lot of interest in image g…

Why Video Agent models are next — Ethan He, xAI Grok Imagine (98 minute read)

@HuggingPapers: Top AI Papers of The Week (May 25-31): - Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players - SkillO…

Qwen's Embodied World Modeling (28 minute read)