Tag
EMMA is a physics-informed multimodal framework that recovers dynamical parameters from raw video, audio, and image data using a Liquid Time-Constant network and physics-constrained loss, outperforming existing baselines across diverse benchmarks.