power-law

#power-law

Scaling Laws, Carefully (25 minute read)

TLDR AI ↗ · yesterday Cached

A comprehensive overview of scaling laws in deep learning, tracing their theoretical roots and empirical findings, and explaining how loss decreases predictably with model size, data, and compute.

0 favorites 0 likes

#power-law

How LoRA Remembers? A Parametric Memory Law for LLM Finetuning

Hugging Face Daily Papers ↗ · 2026-05-28 Cached

This paper investigates the quantitative limits of parametric memory in LLMs using LoRA as a probe, establishing a power law relationship and introducing a threshold-guided optimization method called MemFT for improved memory performance.

0 favorites 0 likes

#power-law

Saturating Scaling Laws for Equational Discovery: A Phenomenology of Growth Dynamics in Three Toy Substrates with Two Real-World Replications

arXiv cs.AI ↗ · 2026-05-26 Cached

This paper investigates growth dynamics in deterministic equational discovery across three toy substrates and two real-world replications, finding substrate-conditional saturating power-law scaling.

0 favorites 0 likes

#power-law

Scaling laws for neural language models

OpenAI Blog ↗ · 2020-01-23 Cached

Foundational empirical study demonstrating power-law scaling relationships between language model performance and model size, dataset size, and compute budget, with implications for optimal training allocation and sample efficiency.

0 favorites 0 likes

power-law

Scaling Laws, Carefully (25 minute read)

How LoRA Remembers? A Parametric Memory Law for LLM Finetuning

Saturating Scaling Laws for Equational Discovery: A Phenomenology of Growth Dynamics in Three Toy Substrates with Two Real-World Replications

Scaling laws for neural language models

Submit Feedback