humanoid-motion

Tag

Cards List
#humanoid-motion

MotionVLA: Vision-Language-Action Model for Humanoid Motion

Hugging Face Daily Papers · 2026-06-13 Cached

Proposes MotionVLA, a vision-language-action model for humanoid motion generation using a dual-stream frequency tokenizer that separately encodes pose and physical dynamics, achieving better diversity and consistency.

0 favorites 0 likes
← Back to home

Submit Feedback