@SaitoWu: A group at Baidu Research US predicted ten years ago: Don't bet all AI compute on NVIDIA. So they actually invested in a 'wafer-scale' chip company — Cerebras. In 2016, Zhou Nan left investment banking for Baidu's US AI research institute. Andrew Ng was leading the team, budgets were ample, GPUs were bought freely. Dario (An…
Summary
The article recounts Baidu Research US's investment in Cerebras, a wafer-scale chip company, a decade ago. It analyzes the shift in the AI chip market from training to inference and the importance of non-consensus investments.
View Cached Full Text
Cached at: 06/17/26, 06:02 PM
Ten years ago, a group at Baidu US AI research already anticipated: they couldn’t bet all AI compute on NVIDIA. And they ended up investing in a “wafer-scale” chip company — Cerebras.
In 2016, Zhou Nan jumped from investment banking to Baidu’s US AI research. At the time, Andrew Ng was leading the team, budgets were ample, and GPUs could be bought freely. Dario (Anthropic founder) and Greg Diamond were both there. The team was already vaguely sensing the contours of the scaling law: bigger models, more data, more compute → better results. The DeepSpeech2 paper was one of the earliest signals.
Zhou Nan’s first project was Cerebras. He scoured the world for training chips that were not NVIDIA, looked at Graphcore and Wave Computing, and finally chose Cerebras. Because this company was the most radical — they aimed to build a wafer-scale engine, turning an entire wafer into one massive AI compute engine, putting compute units and memory extremely close to dramatically reduce communication costs.
When they invested, Cerebras hadn’t even taped out; they only had a signature. Baidu US researchers directly verified it on what was then the world’s largest language model, and the signals were decent. The investment decision passed almost without resistance — Robin, Lu Qi, Robin Li all approved quickly. The valuation was already over $700M, expensive by 2017 standards, but Zhou Nan was betting on a “non-consensus” idea.
The real difficulty wasn’t the idea itself, but actually building the wafer-scale chip. Yield, heat dissipation, power supply, compiler — every piece was a hard nut. 2017-2019 was almost the darkest hour, with tape-out delayed again and again. But early investors like Benchmark, Foundation, and Eclipse stayed the course, giving enough patience. A chip really takes ten years to see results.
Now Cerebras’s opportunity lies in inference, not training. OpenAI signed at least a $20B deal, drawn by its low latency and high throughput. NVIDIA’s CUDA ecosystem is too strong — the switching cost for training is prohibitively high, but in inference scenarios, Cerebras’s architectural advantages start to show. In fact, Sam Altman personally invested in Cerebras in 2016, even earlier than Baidu.
Zhou Nan reflected: Baidu US research was essentially Silicon Valley’s “Whampoa Military Academy.” So many later founded startups: Inflection, Adept, Anthropic, Cohere… Unfortunately, due to geopolitics, Baidu later tried to raise a dedicated AI fund but didn’t succeed. OpenAI and Databricks were both on the list at the time, but none of the investments materialized.
Ten years ago, everyone worried about relying solely on NVIDIA, yet NVIDIA still became a de facto monopoly. But now the explosion in inference demand has opened a new window for heterogeneous chips. Cerebras is just the beginning; more new inference chip architectures are likely to emerge.
Zhou Nan now looks at AI investing: consensus comes too fast, early windows are getting shorter. He instead focuses on directions where consensus hasn’t formed yet, such as Physical AI (robotics) and new inference chip architectures. What moved me most in this episode was how it recreated that “gods clashing” atmosphere at Baidu US research a decade ago, when the scaling law was just emerging. So many things were foreseen early, but it takes ten years for them to truly land. The story of Cerebras is just one snapshot.
Similar Articles
@paulwalker99318: This LatePost interview is packed with information about Baidu US R&D, Scaling Laws, OpenAI, Anthropic, and Cerebras. > "Dario joining Baidu was a very important step in his career. He was recruited by Greg Diamos. And before joining Baidu, Dario didn't have a computer science or AI background — he came from math, physics, and biology. Greg Diamos saw his intuition for AI and ability to train models."
A summary of the LatePost interview, reviewing Baidu US R&D's early AI布局, including investing in Cerebras, nearly investing in OpenAI and Anthropic, and the flow of talent from Baidu to these companies.
@LuBtc888: 10 years ago, Justin Sun told everyone to pay attention to Nvidia. Later, Nvidia increased more than 200 times. Last year, he said: 'Short-term chip shortage, long-term energy shortage, always storage shortage.' As a result, the storage track also increased dozens of times. Recently, he also said: 'The dividend of virtual AI is almost over. The real main line in the next three years is physical AI!' Most importantly, he is not...
Justin Sun recently stated that the dividend of virtual AI is coming to an end, and the main line for the next three years is physical AI, and he has invested $1 billion. His past predictions (Nvidia, storage track) have all seen huge gains.
@seclink: https://x.com/seclink/status/2056711091129118741
In-depth interview with Jensen Huang, reviewing Nvidia's history from betting the company on CUDA to becoming the AI powerhouse, explaining the four scaling laws of AI and the development direction for the next decade, emphasizing compute bottlenecks and extreme co-design philosophy.
@snowboat84: https://x.com/snowboat84/status/2061962883651731602
This article is the first part of the AI Engineering Panorama series. From a historical perspective, it reviews the evolution of GPUs from gaming graphics cards to AI accelerators, the bold bet of CUDA, the independent path of Google's TPU, and why NVIDIA ultimately prevailed. It also provides a detailed analysis of the underlying logic of AI infrastructure such as chips, supply chain, networking, and power.
@dongxi_nlp: A very valuable article, the last 6 takeaways are worth pondering. Among them, the last two: 5. The data industry is far from developed. Anthropic and OpenAI spend over $10 million on a single environment, while Chinese AI labs have a 'build rather than buy' mentality. 6. Countless...
The article summarizes the current state of the AI data industry, pointing out that the data industry is not yet mature. Anthropic and OpenAI spend over $10 million on a single environment, while Chinese AI labs tend to build rather than buy. In addition, many labs have access to Huawei chips but still crave more Nvidia chips.