self-supervised

#self-supervised

MemTrain: Self-Supervised Context Memory Training

arXiv cs.CL ↗ · yesterday Cached

MemTrain proposes a self-supervised training framework that uses masked reconstruction and intermediate memory recall proxy tasks on Wikipedia corpora to enhance LLM agents' context memory, achieving up to 17.67 point gains on downstream memory-intensive QA benchmarks.

0 favorites 0 likes

#self-supervised

MindZero: Learning Online Mental Reasoning With Zero Annotations

arXiv cs.AI ↗ · 2d ago Cached

MindZero introduces a self-supervised reinforcement learning framework that trains multimodal large language models for efficient and robust online mental reasoning without requiring mental state annotations, outperforming model-based methods in accuracy and efficiency.

0 favorites 0 likes

#self-supervised

RayDer: Scalable Self-Supervised Novel View Synthesis from Real-World Video

Hugging Face Daily Papers ↗ · 6d ago Cached

RayDer is a unified feed-forward transformer that consolidates camera estimation, scene reconstruction, and rendering for self-supervised novel view synthesis from real-world video, achieving clean power-law scaling and strong zero-shot performance.

0 favorites 0 likes

#self-supervised

The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement

Hugging Face Daily Papers ↗ · 6d ago Cached

The SAVE framework improves reward model training by using value functions to grade on-policy responses and update models through contrastive objectives, achieving outperforming results across six benchmarks.

0 favorites 0 likes

#self-supervised

ChildVox: A Speech, Audio, and Large Audio-Language Model Benchmark in Understanding and Characterizing Sound across Childhood

Hugging Face Daily Papers ↗ · 2026-05-28 Cached

ChildVox presents a comprehensive benchmark for analyzing children's acoustic communication across developmental stages, integrating over 20 sub-tasks from 17 child-centered audio and speech datasets.

0 favorites 0 likes

#self-supervised

PilotWiMAE: Pilot-Native Representation Learning for Wireless Channels

arXiv cs.AI ↗ · 2026-05-25 Cached

PilotWiMAE introduces a self-supervised framework that directly ingests noisy pilot observations for wireless channel representation learning, removing the unrealistic full-CSI assumption and enabling robust cross-frequency beam selection and channel estimation that beats supervised baselines.

0 favorites 0 likes

#self-supervised

Self-Improving In-Context Learning

arXiv cs.CL ↗ · 2026-05-25 Cached

This paper proposes a method to improve in-context learning by optimizing the continuous embeddings of a fixed few-shot prompt at test time, using a self-supervised confidence proxy derived from the model's log-probabilities without requiring fine-tuning or token generation.

0 favorites 0 likes

#self-supervised

NITP: Next Implicit Token Prediction for LLM Pre-training

Hugging Face Daily Papers ↗ · 2026-05-24 Cached

Next Implicit Token Prediction (NITP) enhances language model pre-training by adding dense continuous supervision in representation space, improving generalization and performance across model sizes with minimal computational overhead.

0 favorites 0 likes

#self-supervised

Temporal Contrastive Transformer for Financial Crime Detection: Self-Supervised Sequence Embeddings via Predictive Contrastive Coding

arXiv cs.LG ↗ · 2026-05-22 Cached

Introduces the Temporal Contrastive Transformer (TCT), a self-supervised framework for learning temporal embeddings from financial transactions for fraud detection. Achieves AUC 0.8644 with embeddings alone but does not improve over strong engineered features (AUC 0.9205 vs 0.9245), indicating learned representations overlap with existing features.

0 favorites 0 likes

#self-supervised

@stephenbtl: My talk at @aiDotEngineer is now online. I talked about our research and where @bfl_ml is heading. Thanks @swyx for the…

X AI KOLs Following ↗ · 2026-05-11 Cached

Black Forest Labs shared the evolution of the Flux series models at the AI Engineer Conference and released the SelfFlow research paper, proposing a self-supervised multimodal training method that does not require external encoders.

0 favorites 0 likes

self-supervised

Submit Feedback