self-supervised-learning

#self-supervised-learning

CF-JEPA: Mask-free forward prediction with asymmetric encoder utilization for time-series representation learning

arXiv cs.LG ↗ · 2026-06-08 Cached

Proposes CF-JEPA, a mask-free self-supervised learning framework for time-series that uses multi-horizon forward prediction from random crops and exploits asymmetry between online and target encoders for improved performance on classification, forecasting, and anomaly detection.

0 favorites 0 likes

#self-supervised-learning

Predict and Reconstruct: Joint Objectives for Self-Supervised Language Representation Learning

arXiv cs.CL ↗ · 2026-06-05 Cached

The paper proposes a hybrid pre-training objective combining JEPA latent-space prediction with MLM reconstruction for language models, showing improved embedding uniformity and semantic-lexical balance.

0 favorites 0 likes

#self-supervised-learning

The Loss Is Not Enough: Sampling Conditions and Inductive Bias in Contrastive Representation Learning

arXiv cs.LG ↗ · 2026-06-04 Cached

This paper develops a measure-theoretic framework analyzing when contrastive learning recovers meaningful latent geometry, introducing a 'diversity condition' on positive-pair sampling and a support-corrected InfoNCE variant, with experiments validating that sampling diversity and architectural inductive bias interact critically in contrastive representation learning.

0 favorites 0 likes

#self-supervised-learning

Building The Ph(ysical)AI Layer Of Machine Intelligence

arXiv cs.LG ↗ · 2026-06-04 Cached

Researchers at MIT Lincoln Laboratory propose 'principle-driven foundation models' that encode signal-theoretic physical principles (Fourier decomposition, energy conservation, symmetry) instead of learning statistical correlations from large paired datasets. Trained exclusively on RF data, their 1.99M parameter frozen encoder achieves 77.7% average accuracy across 15 diverse tasks spanning audio, images, text, and video without any fine-tuning on target domains.

0 favorites 0 likes

#self-supervised-learning

Regret Pre-training: Bridging Prior and Posterior Views for Enhanced Knowledge Grounding

arXiv cs.CL ↗ · 2026-06-03 Cached

This paper introduces Regret Pre-training, a self-supervised framework that uses a dual-view architecture to incorporate future context into causal language model training, improving performance on downstream tasks by up to 18 percentage points without adding parameters.

0 favorites 0 likes

#self-supervised-learning

@NielsRogge: NEPA has now been added here: Check the evals at the bottom to compare to other models

X AI KOLs Following ↗ · 2026-06-02 Cached

NEPA is a new method for visual self-supervised learning and generative pretraining that predicts the next embedding autoregressively, and has been added to a benchmark for evaluation.

0 favorites 0 likes

#self-supervised-learning

When Softmax Fails at the Top: Extreme Value Corrections for InfoNCE

arXiv cs.LG ↗ · 2026-06-02 Cached

The paper identifies a misalignment between the softmax-based InfoNCE loss and the normalized embedding setting in modern contrastive learning. It proposes WEINCE, a simple modification that blends softmax logits with an endpoint shortfall correction using extreme value theory, yielding consistent improvements across vision benchmarks.

0 favorites 0 likes

#self-supervised-learning

@alesfav: AI needs vastly more data than we do. One idea might close the gap: don't predict raw signals (tokens), predict your ow…

X AI KOLs Following ↗ · 2026-05-29 Cached

This thread presents a theoretical result showing that predicting abstract latent representations (as in JEPA and data2vec) instead of raw tokens can exponentially reduce the data gap between AI and human learning.

0 favorites 0 likes

#self-supervised-learning

Learning Robust and Task-Invariant Functional Representation from fMRI through Siamese Self-Supervised Learning

arXiv cs.LG ↗ · 2026-05-29 Cached

This paper introduces BrainSimSiam, a lightweight self-supervised framework using siamese networks to learn robust fMRI representations from positive-only pairs, achieving strong performance on downstream tasks even with limited data.

0 favorites 0 likes

#self-supervised-learning

DIVE: Embedding Compression via Self-Limiting Gradient Updates

arXiv cs.CL ↗ · 2026-05-21 Cached

Proposes DIVE, a compression adapter for embedding dimensionality reduction that uses self-limiting gradient updates and head-wise NT-Xent contrastive loss to prevent overfitting on small datasets, outperforming existing methods on BEIR benchmarks.

0 favorites 0 likes

#self-supervised-learning

Instance Discrimination for Link Prediction

arXiv cs.LG ↗ · 2026-05-21 Cached

This paper adapts instance discrimination self-supervised learning to link prediction in graphs, proposing new models L-GRACE and L-BGRL that operate on link representations and improve performance especially on unattributed graphs.

0 favorites 0 likes

#self-supervised-learning

VCR: Learning Valid Contextual Representation for Incomplete Wearable Signals

arXiv cs.LG ↗ · 2026-05-20

VCR is a self-supervised framework that learns robust representations from incomplete wearable signals using orthogonal tokenization and missing-aware mixture-of-experts, improving performance under modality missingness.

0 favorites 0 likes

#self-supervised-learning

Baba in Wonderland: Online Self-Supervised Dynamics Discovery for Executable World Models

arXiv cs.AI ↗ · 2026-05-19 Cached

Introduces Alice, a closed-loop system that learns executable world models online under prior misalignment by treating failed candidate updates as structural signal, achieving improved performance on a variant of Baba Is You with semantically remapped labels.

0 favorites 0 likes

#self-supervised-learning

@xbresson: How do we design materials with AI? Excited to introduce Crys-JEPA, a new generative technique in collaboration w/ @liu…

X AI KOLs Following ↗ · 2026-05-19 Cached

Crys-JEPA introduces a joint embedding predictive architecture for crystals that learns an energy-aware latent space, achieving significant improvements in stability and novelty for de novo crystal discovery.

0 favorites 0 likes

#self-supervised-learning

AudioMosaic: Contrastive Masked Audio Representation Learning

arXiv cs.LG ↗ · 2026-05-15 Cached

AudioMosaic introduces a contrastive learning-based audio encoder that uses structured time-frequency masking on spectrogram patches for efficient large-batch training, achieving state-of-the-art performance on audio benchmarks and improving audio-language models.

0 favorites 0 likes

#self-supervised-learning

CSI-JEPA: Towards Foundation Representations for Ubiquitous Sensing with Minimal Supervision

arXiv cs.LG ↗ · 2026-05-15 Cached

CSI-JEPA is a self-supervised framework for learning reusable representations from unlabeled Wi-Fi channel state information, enabling label-efficient multi-task sensing. It achieves up to 98% label savings and outperforms supervised models.

0 favorites 0 likes

#self-supervised-learning

A Unified Geometric Framework for Weighted Contrastive Learning

arXiv cs.LG ↗ · 2026-05-15 Cached

This paper introduces a unified geometric framework showing that weighted InfoNCE objectives can be interpreted as Distance Geometry Problems, providing exact characterizations of optimal embeddings for supervised and weakly supervised contrastive learning methods and revealing when such embeddings are geometrically realizable, degenerate, or inconsistent.

0 favorites 0 likes

#self-supervised-learning

Network-Aware Bilinear Tokenization for Brain Functional Connectivity Representation Learning

arXiv cs.AI ↗ · 2026-05-15 Cached

NERVE proposes a network-aware bilinear tokenization method for self-supervised learning on brain functional connectivity matrices using masked autoencoders, improving representation learning across developmental cohorts.

0 favorites 0 likes

#self-supervised-learning

HEPA: A Self-Supervised Horizon-Conditioned Event Predictive Architecture for Time Series

arXiv cs.LG ↗ · 2026-05-13 Cached

This paper introduces HEPA, a self-supervised architecture for predicting rare critical events in time series using a Joint-Embedding Predictive Architecture (JEPA) pretraining strategy. It demonstrates superior performance across multiple domains with significantly fewer labeled data and tuned parameters compared to leading models.

0 favorites 0 likes

#self-supervised-learning

GitHub - keon/jepa: implementing minimal versions of joint-embedding predictive architecture (JEPA)

Reddit r/ArtificialInteligence ↗ · 2026-05-12 Cached

A GitHub repository providing minimal, standalone PyTorch reimplementations of JEPA family models (I-JEPA, V-JEPA, V-JEPA 2, C-JEPA) for educational purposes, including tutorials and visualization tools.

0 favorites 0 likes

self-supervised-learning

Submit Feedback