double-descent

Tag

Cards List
#double-descent

@snowboat84: To add a supplementary note, regarding the phenomena emerging from AI—scaling laws, emergence, double descent, representation geometry—the papers discussing them are already numerous. But there is a big problem: they are all thinking in the way of computer scientists, not physicists. What is a computer sci…

X AI KOLs Timeline · 2026-05-24 Cached

The author comments that current AI research overuses the thinking style of computer science and lacks a physics-based approach, proposing the need to establish an ideal system like 'Cyber Space' to lay a theoretical foundation.

0 favorites 0 likes
#double-descent

@snowboat84: Today, let's discuss something hardcore. One question: what level of mathematics does AI use? From the perspective of tools and models themselves, the mathematics used by AI has an average age of 150 years, with most being from before the mid-19th century: matrix multiplication, gradient descent, chain rule, Fourier transform, inner product, probability — mostly content from the first two years of undergraduate studies. But some phenomena emerging from AI...

X AI KOLs Timeline · 2026-05-23 Cached

Discusses that the mathematics used by AI is mainly linear algebra, calculus, etc., from before the 19th century, but emerging phenomena such as Scaling Law, emergent abilities, double descent, in-context learning, and representation geometry lack mathematical explanation. Analogizes to the clouds in physics in 1900, suggesting it may drive the development of 21st-century mathematics.

0 favorites 0 likes
#double-descent

Double descent for least-squares interpolation on contaminated data: A simulation study

arXiv cs.LG · 2026-05-22 Cached

This simulation study examines the double descent phenomenon for least-squares interpolation on contaminated data in linear regression, comparing the performance of the least-squares interpolator with robust alternatives.

0 favorites 0 likes
#double-descent

Deep double descent

OpenAI Blog · 2019-12-05 Cached

OpenAI research reveals the 'double descent' phenomenon where test error exhibits a non-monotonic pattern as both model size and training steps increase, challenging traditional understanding of the bias-variance tradeoff in deep learning.

0 favorites 0 likes
← Back to home

Submit Feedback