latent-representations

Tag

Cards List
#latent-representations

@alesfav: AI needs vastly more data than we do. One idea might close the gap: don't predict raw signals (tokens), predict your ow…

X AI KOLs Following · 2026-05-29 Cached

This thread presents a theoretical result showing that predicting abstract latent representations (as in JEPA and data2vec) instead of raw tokens can exponentially reduce the data gap between AI and human learning.

0 favorites 0 likes
#latent-representations

Learned Relay Representations for Forward-Thinking Discrete Diffusion Models

arXiv cs.LG · 2026-05-25 Cached

This paper introduces Learned Relay Representations (Relay), a method that allows masked diffusion models to propagate latent information across denoising steps, overcoming the hard reset problem and improving performance-latency trade-offs. The method is shown to outperform standard supervised finetuning on coding tasks while reducing inference latency by up to 32%.

0 favorites 0 likes
#latent-representations

Sub-JEPA: a simple fix to LeCun group's LeWorldModel that consistently improves performance [P]

Reddit r/MachineLearning · 2026-05-18

Sub-JEPA improves LeWorldModel by applying Gaussian regularization in frozen random orthogonal subspaces, consistently outperforming the original on benchmarks with up to +10.7 percentage points improvement.

0 favorites 0 likes
← Back to home

Submit Feedback