regularization

Tag

Cards List
#regularization

Rotation-Preserving Supervised Fine-Tuning

arXiv cs.LG · 12h ago Cached

This paper introduces Rotation-Preserving Supervised Fine-Tuning (RPSFT), a method that improves out-of-domain generalization by preserving projected rotations in pretrained singular subspaces during fine-tuning.

0 favorites 0 likes
#regularization

FragileFlow: Spectral Control of Correct-but-Fragile Predictions for Foundation Model Robustness

arXiv cs.CL · yesterday Cached

This paper introduces FragileFlow, a plug-in regularizer that improves the robustness of LLMs and VLMs by controlling 'correct-but-fragile' predictions through spectral analysis and PAC-Bayes bounds.

0 favorites 0 likes
#regularization

Beyond Penalization: Diffusion-based Out-of-Distribution Detection and Selective Regularization in Offline Reinforcement Learning

arXiv cs.LG · yesterday Cached

This paper introduces DOSER, a framework using diffusion models for out-of-distribution detection and selective regularization in offline reinforcement learning. It aims to improve performance on static datasets by distinguishing between beneficial and detrimental OOD actions.

0 favorites 0 likes
#regularization

Sub-JEPA: Subspace Gaussian Regularization for Stable End-to-End World Models

Hugging Face Daily Papers · 3d ago Cached

The authors introduce Sub-JEPA, a method using Subspace Gaussian Regularization to improve the stability of end-to-end world models like LeWM, showing consistent performance gains on continuous-control benchmarks.

0 favorites 0 likes
#regularization

Understanding and Enforcing Weight Disentanglement in Task Arithmetic

Hugging Face Daily Papers · 2026-04-18 Cached

The paper introduces OrthoReg, a plug-and-play regularizer that enforces weight orthogonality during fine-tuning to improve task arithmetic and model merging without extra compute.

0 favorites 0 likes
#regularization

Adversarial training methods for semi-supervised text classification

OpenAI Blog · 2016-05-25 Cached

This paper presents adversarial and virtual adversarial training methods adapted for text classification by applying perturbations to word embeddings in RNNs rather than raw inputs. The approach achieves state-of-the-art results on semi-supervised and supervised text classification benchmarks while reducing overfitting.

0 favorites 0 likes
← Back to home

Submit Feedback