Sub-JEPA: Subspace Gaussian Regularization for Stable End-to-End World Models
Summary
The authors introduce Sub-JEPA, a method using Subspace Gaussian Regularization to improve the stability of end-to-end world models like LeWM, showing consistent performance gains on continuous-control benchmarks.
View Cached Full Text
Cached at: 05/12/26, 07:33 AM
Paper page - Sub-JEPA: Subspace Gaussian Regularization for Stable End-to-End World Models
Source: https://huggingface.co/papers/2605.09241 We’re releasing Sub-JEPA 🌐
LeWM (from LeCun’s group) is the first end-to-end trainable JEPA world model — it uses isotropic Gaussian regularization to prevent representation collapse. Clean and effective.
Our take: latent representations sit on low-dimensional manifolds, so enforcing a full-space Gaussian is too strong a bias.
We propose Subspace Gaussian Regularization: instead of constraining the full embedding space, we project latents into multiple orthogonal subspaces and apply Gaussian constraints there. Simple change, better inductive bias.
Results on 4 continuous-control benchmarks consistently outperform LeWM, with gains correlated to reductions in effective rank — the lower the task’s intrinsic dimensionality, the larger the gain.
Similar Articles
Sub-JEPA: a simple fix to LeCun group's LeWorldModel that consistently improves performance [P]
Sub-JEPA improves LeWorldModel by applying Gaussian regularization in frozen random orthogonal subspaces, consistently outperforming the original on benchmarks with up to +10.7 percentage points improvement.
LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels
LeWorldModel introduces a stable, end-to-end Joint-Embedding Predictive Architecture that trains directly from pixels with minimal hyperparameters and provable anti-collapse guarantees. It achieves significant speedups in planning compared to foundation models while maintaining competitive performance on robotic manipulation tasks.
@_akhaliq: VISReg Variance-Invariance-Sketching Regularization for JEPA training
Introduces VISReg, a regularization method for JEPA (Joint Embedding Predictive Architecture) training that combines variance, invariance, and sketching constraints.
Delta-JEPA: Learning Action-Sensitive World Models via Latent Difference Decoding
Delta-JEPA introduces a reconstruction-free world model that augments latent forward prediction with a Latent Difference Action Decoder to prevent collapse and improve action-sensitivity, achieving better planning performance on visual continuous-control tasks.
So, what is Yann LeCun's "World Models" and JEPA and is it Really a Replacement for LLMs?
Discusses Yann LeCun's 'World Models' and JEPA from a recent arXiv paper, clarifying that it is not a replacement for LLMs but a model optimized for visual processing in robotics, self-driving, and industrial controls.