scaling-law

Tag

Cards List
#scaling-law

EdgeBench Reveals the Next Scaling Law: On-the-Fly AI Learning Speed Doubles Every 3 Months

Reddit r/singularity · 17h ago

EdgeBench reveals a new scaling law indicating that on-the-fly AI learning speed doubles every three months.

0 favorites 0 likes
#scaling-law

Data-driven Machine Learning Cannot Reach Symbolic-level Logical Reasoning -- The Limit of the Scaling Law

arXiv cs.AI · 2026-06-26 Cached

The paper argues that data-driven machine learning systems, including GPT-5, cannot achieve symbolic-level logical reasoning through scaling alone, due to inherent limitations in distinguishing logical structures from statistical regularities.

0 favorites 0 likes
#scaling-law

@AYi_AInotes: Everyone is raving about Japan's Fugu beating GPT on benchmarks, but I bet 99% of people haven't understood what really makes it mind-blowing. First off, this isn't some giant monolithic model at all—it has only 0.6B parameters and essentially works as an AI project manager. It handles simple tasks on its own, automatically splits complex ones, and selects the most suitable models from a global pool of top-tier models...

X AI KOLs Timeline · 2026-06-23 Cached

Sakana AI releases Fugu, a multi-agent orchestration system with only 0.6B parameters. By intelligently splitting tasks and coordinating multiple models, it achieves state-of-the-art performance while bypassing traditional parameter scaling. This marks the transition of multi-agent orchestration from a lab curiosity to a practical productivity tool.

0 favorites 0 likes
#scaling-law

@paulwalker99318: This LatePost interview is packed with information about Baidu US R&D, Scaling Laws, OpenAI, Anthropic, and Cerebras. > "Dario joining Baidu was a very important step in his career. He was recruited by Greg Diamos. And before joining Baidu, Dario didn't have a computer science or AI background — he came from math, physics, and biology. Greg Diamos saw his intuition for AI and ability to train models."

X AI KOLs Timeline · 2026-06-22 Cached

A summary of the LatePost interview, reviewing Baidu US R&D's early AI布局, including investing in Cerebras, nearly investing in OpenAI and Anthropic, and the flow of talent from Baidu to these companies.

0 favorites 0 likes
#scaling-law

@SaitoWu: A group at Baidu Research US predicted ten years ago: Don't bet all AI compute on NVIDIA. So they actually invested in a 'wafer-scale' chip company — Cerebras. In 2016, Zhou Nan left investment banking for Baidu's US AI research institute. Andrew Ng was leading the team, budgets were ample, GPUs were bought freely. Dario (An…

X AI KOLs Timeline · 2026-06-17 Cached

The article recounts Baidu Research US's investment in Cerebras, a wafer-scale chip company, a decade ago. It analyzes the shift in the AI chip market from training to inference and the importance of non-consensus investments.

0 favorites 0 likes
#scaling-law

The Weight Norm Sets the Grokking Timescale: A Causal Delay Law

arXiv cs.LG · 2026-06-15 Cached

This paper demonstrates that the weight norm causally controls the timescale of grokking in neural networks, reconciling conflicting accounts. Through interventions, it shows that grokking follows an exponential delay law and that norm magnitude dominates grokking time over learning rate across architectures.

0 favorites 0 likes
#scaling-law

@gntalktalk: This is the best methodology in AI development recently: The scaling law of software engineering --- Channel AI founder Luke Orthwine proposes a new paradigm: ditch the single-threaded linear "chess thinking" and switch to a high-concurrency, macro-scheduling, saturation-attack "real-time strategy game…

X AI KOLs Timeline · 2026-06-14 Cached

Channel AI founder Luke Orthwine proposes a new software development methodology: shifting programming thinking from traditional chess-like single-threaded linear thinking to real-time strategy game (RTS) style high concurrency, macro scheduling, and saturation attack to achieve efficient development in the AI Agent era.

0 favorites 0 likes
#scaling-law

@snowboat84: https://x.com/snowboat84/status/2062686432335184321

X AI KOLs Timeline · 2026-06-05 Cached

This article explores the deep connections between physics and deep learning, analyzes the isomorphism of phenomena such as Scaling Law and emergence with concepts like critical scaling laws and phase transitions in physics, and reviews the current status and prospects of applying physical methodologies in AI.

0 favorites 0 likes
#scaling-law

Streaming Communication in Multi-Agent Reasoning

Hugging Face Daily Papers · 2026-06-03 Cached

StreamMA introduces a streaming communication paradigm for multi-agent reasoning that pipelines intermediate results to reduce latency and improve effectiveness by leveraging more reliable early steps, outperforming baselines across benchmarks and revealing a step-level scaling law.

0 favorites 0 likes
#scaling-law

Another ‘DeepSeek moment’? Huawei milestone alters China trajectory in chip race: analysts.

Reddit r/ArtificialInteligence · 2026-05-31 Cached

Huawei unveils the Tau Scaling Law, a chip architectural workaround to bypass US sanctions and achieve 1.4nm-equivalent transistor density by 2031, marking a significant step toward China's semiconductor self-sufficiency and altering the tech rivalry with Washington.

0 favorites 0 likes
#scaling-law

@snowboat84: Today, let's discuss something hardcore. One question: what level of mathematics does AI use? From the perspective of tools and models themselves, the mathematics used by AI has an average age of 150 years, with most being from before the mid-19th century: matrix multiplication, gradient descent, chain rule, Fourier transform, inner product, probability — mostly content from the first two years of undergraduate studies. But some phenomena emerging from AI...

X AI KOLs Timeline · 2026-05-23 Cached

Discusses that the mathematics used by AI is mainly linear algebra, calculus, etc., from before the 19th century, but emerging phenomena such as Scaling Law, emergent abilities, double descent, in-context learning, and representation geometry lack mathematical explanation. Analogizes to the clouds in physics in 1900, suggesting it may drive the development of 21st-century mathematics.

0 favorites 0 likes
#scaling-law

@jinchenma_ai: I watched Xiaojun Zhang's interview with Yao Shunyu for 4 hours, packed with valuable insights. He made a particularly contrarian judgment. Many say pre-training has hit a wall and Scaling Law has reached its limit. He says no, and there are no signs of hitting a ceiling in the coming months. So why do so many people think it's hit a wall? He directly said: the vast majority of people who shout about hitting a wall have bugs in their own code...

X AI KOLs Timeline · 2026-05-21 Cached

In the interview, Yao Shunyu proposed a contrarian view that pre-training has not hit a wall and Scaling Law has not reached its limit, claiming that most people who say it has hit a wall have bugs in their code.

0 favorites 0 likes
#scaling-law

@Valley101_Qian: Congrats to Yuandong @tydsh. At the end of our previous interview, the "new direction" he mentioned was officially announced today: neolab Recursive_SI, with $650 million in funding and a valuation of $4.65 billion. Looking forward to more research freedom and research taste in the industry...

X AI KOLs Timeline · 2026-05-14 Cached

After being laid off from Meta, Yuandong announced a new direction, raising $650 million to found neolab Recursive_SI with a valuation of $4.65 billion. In an interview, he shared insights on AI trends, LLM limitations, reinforcement learning, and research freedom.

0 favorites 0 likes
← Back to home

Submit Feedback