@ChrisInterno: Signals of physical plausibility are hiding in the geometry of frozen image encoders. No video training. No physics sup…

X AI KOLs Following Papers

Summary

The tweet highlights a research finding that signals of physical plausibility can be extracted from the geometry of frozen image encoders without video training or physics supervision.

Signals of physical plausibility are hiding in the geometry of frozen image encoders. No video training. No physics supervision. https://t.co/NKmgD8g53f
Original Article
View Cached Full Text

Cached at: 06/23/26, 01:50 PM

Signals of physical plausibility are hiding in the geometry of frozen image encoders. No video training. No physics supervision. https://t.co/NKmgD8g53f

Similar Articles

Building The Ph(ysical)AI Layer Of Machine Intelligence

arXiv cs.LG

Researchers at MIT Lincoln Laboratory propose 'principle-driven foundation models' that encode signal-theoretic physical principles (Fourier decomposition, energy conservation, symmetry) instead of learning statistical correlations from large paired datasets. Trained exclusively on RF data, their 1.99M parameter frozen encoder achieves 77.7% average accuracy across 15 diverse tasks spanning audio, images, text, and video without any fine-tuning on target domains.

EgoPhys: Learning Generalizable Physics Models of Deformable Objects from Egocentric Video

Hugging Face Daily Papers

EgoPhys introduces a framework to construct deformable physical digital twins from egocentric RGB video using generalizable priors and a compact codebook, enabling zero-shot generalization to unseen objects without per-spring optimization. The system is demonstrated on a real robot, showing that egocentric human play video can serve as internal world representation for deformable-object planning.