overfitting

#overfitting

A Training-Time Diagnostic for Generalization via the Log-Alignment Ratio

arXiv cs.LG ↗ · 2026-05-29 Cached

This paper introduces the log-alignment ratio (LAR), a training-time metric that measures parameter-activation alignment and predicts generalization by capturing the spread of weight and activation spectra. Experiments on grokking and a 3B-parameter language model show LAR tracks the transition from memorization to generalization and flags overfitting without held-out data.

0 favorites 0 likes

#overfitting

R2R2: Robust Representation for Intensive Experience Reuse via Redundancy Reduction in Self-Predictive Learning

arXiv cs.LG ↗ · 2026-05-15 Cached

Proposes R2R2, a regularization method for self-predictive learning in reinforcement learning to mitigate overfitting under high update-to-data ratios, achieving significant improvements on continuous control tasks.

0 favorites 0 likes

#overfitting

Dropping learning rate fixed my Qlora fine-tune more than anything else i tried

Reddit r/LocalLLaMA ↗ · 2026-05-14

A user found that reducing the learning rate from 2e-4 to 1e-4 significantly improved QLoRA fine-tuning of Llama 3.1 8B on a small dataset (8k samples), preventing overfitting and leading to better evaluation results.

0 favorites 0 likes

#overfitting

Scaling Laws for Mixture Pretraining Under Data Constraints

arXiv cs.LG ↗ · 2026-05-14 Cached

This paper studies the trade-off between scarce target data and abundant generic data in mixture pretraining, finding that repetition is a key driver of performance and that mixture training tolerates 15-20 repetitions of target data. It introduces a repetition-aware scaling law to optimize mixture configurations under data constraints.

0 favorites 0 likes

#overfitting

Prescriptive Scaling Laws for Data Constrained Training

Hugging Face Daily Papers ↗ · 2026-05-02 Cached

A modified scaling law accounting for data repetition effects provides compute-optimal training strategies for data-constrained scenarios, showing that beyond a point further repetition is counterproductive and compute is better spent on model capacity.

0 favorites 0 likes

#overfitting

Quantifying generalization in reinforcement learning

OpenAI Blog ↗ · 2018-12-06 Cached

OpenAI trained 9 agents on the CoinRun environment with varying numbers of training levels to quantify generalization in reinforcement learning, finding substantial overfitting even with 16,000 training levels and that IMPALA-CNN architectures generalize significantly better than Nature-CNN baselines.

0 favorites 0 likes

overfitting

A Training-Time Diagnostic for Generalization via the Log-Alignment Ratio

R2R2: Robust Representation for Intensive Experience Reuse via Redundancy Reduction in Self-Predictive Learning

Dropping learning rate fixed my Qlora fine-tune more than anything else i tried

Scaling Laws for Mixture Pretraining Under Data Constraints

Prescriptive Scaling Laws for Data Constrained Training

Quantifying generalization in reinforcement learning

Submit Feedback