@SaitoWu: A group at Baidu Research US predicted ten years ago: Don't bet all AI compute on NVIDIA. So they actually invested in a 'wafer-scale' chip company — Cerebras. In 2016, Zhou Nan left investment banking for Baidu's US AI research institute. Andrew Ng was leading the team, budgets were ample, GPUs were bought freely. Dario (An…

X AI KOLs Timeline 06/17/26, 02:07 PM News

baidu-america-research cerebras wafer-scale ai-chips inference venture-capital scaling-law

Summary

The article recounts Baidu Research US's investment in Cerebras, a wafer-scale chip company, a decade ago. It analyzes the shift in the AI chip market from training to inference and the importance of non-consensus investments.

A group at Baidu Research US predicted ten years ago: Don't bet all AI compute on NVIDIA. So they actually invested in a 'wafer-scale' chip company — Cerebras. In 2016, Zhou Nan left investment banking for Baidu's US artificial intelligence research institute. Andrew Ng was leading the team, budgets were ample, GPUs could be bought freely. Dario (Anthropic founder) and Greg Diamond were there too. The team had already vaguely sensed the shape of scaling laws: the larger the model, the more data, the more compute, the better the results. The DeepSpeech 2 paper was one of the earliest signals. Zhou Nan's first project was Cerebras. At the time, he searched globally for training chips 'not made by NVIDIA,' looked at Graphcore and Wave Computing, and ultimately chose Cerebras. Because this company was the most aggressive — they aimed to build a wafer-scale engine, turning an entire wafer into a massive AI compute engine, placing compute units and memory extremely close together to drastically reduce communication costs. When invested, Cerebras hadn't even taped out; they only had a signature. Baidu Research's researchers directly verified it, running it on the world's largest language model at the time, and the signals were decent. The investment decision faced almost no resistance — Robin, Lu Qi, and Robin Li quickly approved. The valuation was already over $700 million, expensive for 2017, but Zhou Nan was betting on 'non-consensus.' The truly hard part wasn't the idea, but building the wafer-scale. Yield, heat dissipation, power, compiler — each was a hard nut to crack. 2017 to 2019 were almost the darkest hours, with tapeouts delayed repeatedly. But early investors like Benchmark, Foundation, and Eclipse stayed with them, offering plenty of patience. Chips truly take ten years to show results. Now Cerebras's opportunity lies in inference, not training. OpenAI signed a deal worth at least $20 billion precisely for its low latency and high throughput. NVIDIA's CUDA ecosystem is too strong, making migration costs extremely high for training, but in inference scenarios, Cerebras's architecture advantages begin to shine. In fact, Sam Altman had personally invested in Cerebras in 2016, even earlier than Baidu. Zhou Nan reflects that Baidu Research US was like the 'Whampoa Military Academy' of Silicon Valley. Too many people later left to start companies: Inflection, Adept, Anthropic, Cohere… Unfortunately, due to geopolitical issues, Baidu later tried to raise a dedicated AI fund but didn't succeed. OpenAI and Databricks were on the list at the time, but ultimately no investments were made. Everyone worried ten years ago about relying solely on NVIDIA, yet NVIDIA still became a de facto monopoly. But now the explosion of inference demand has opened a new window for heterogeneous chips. Cerebras is just the beginning; more inference chips with new architectures may follow. Now Zhou Nan approaches AI investment: consensus comes too fast, early windows become shorter and shorter. Instead, he focuses on directions without consensus yet, such as Physical AI (robotics) and new inference chip architectures. What moved me most in this episode is how it recreates the atmosphere of that 'clash of titans' at Baidu Research US when scaling laws were just emerging. Many things were seen early on, but truly coming to fruition took ten years. Cerebras's story is just one example.

Original Article

View Cached Full Text

Cached at: 06/17/26, 06:02 PM

Ten years ago, a group at Baidu US AI research already anticipated: they couldn’t bet all AI compute on NVIDIA. And they ended up investing in a “wafer-scale” chip company — Cerebras.

In 2016, Zhou Nan jumped from investment banking to Baidu’s US AI research. At the time, Andrew Ng was leading the team, budgets were ample, and GPUs could be bought freely. Dario (Anthropic founder) and Greg Diamond were both there. The team was already vaguely sensing the contours of the scaling law: bigger models, more data, more compute → better results. The DeepSpeech2 paper was one of the earliest signals.

Zhou Nan’s first project was Cerebras. He scoured the world for training chips that were not NVIDIA, looked at Graphcore and Wave Computing, and finally chose Cerebras. Because this company was the most radical — they aimed to build a wafer-scale engine, turning an entire wafer into one massive AI compute engine, putting compute units and memory extremely close to dramatically reduce communication costs.

When they invested, Cerebras hadn’t even taped out; they only had a signature. Baidu US researchers directly verified it on what was then the world’s largest language model, and the signals were decent. The investment decision passed almost without resistance — Robin, Lu Qi, Robin Li all approved quickly. The valuation was already over $700M, expensive by 2017 standards, but Zhou Nan was betting on a “non-consensus” idea.

The real difficulty wasn’t the idea itself, but actually building the wafer-scale chip. Yield, heat dissipation, power supply, compiler — every piece was a hard nut. 2017-2019 was almost the darkest hour, with tape-out delayed again and again. But early investors like Benchmark, Foundation, and Eclipse stayed the course, giving enough patience. A chip really takes ten years to see results.

Now Cerebras’s opportunity lies in inference, not training. OpenAI signed at least a $20B deal, drawn by its low latency and high throughput. NVIDIA’s CUDA ecosystem is too strong — the switching cost for training is prohibitively high, but in inference scenarios, Cerebras’s architectural advantages start to show. In fact, Sam Altman personally invested in Cerebras in 2016, even earlier than Baidu.

Zhou Nan reflected: Baidu US research was essentially Silicon Valley’s “Whampoa Military Academy.” So many later founded startups: Inflection, Adept, Anthropic, Cohere… Unfortunately, due to geopolitics, Baidu later tried to raise a dedicated AI fund but didn’t succeed. OpenAI and Databricks were both on the list at the time, but none of the investments materialized.

Ten years ago, everyone worried about relying solely on NVIDIA, yet NVIDIA still became a de facto monopoly. But now the explosion in inference demand has opened a new window for heterogeneous chips. Cerebras is just the beginning; more new inference chip architectures are likely to emerge.

Zhou Nan now looks at AI investing: consensus comes too fast, early windows are getting shorter. He instead focuses on directions where consensus hasn’t formed yet, such as Physical AI (robotics) and new inference chip architectures. What moved me most in this episode was how it recreated that “gods clashing” atmosphere at Baidu US research a decade ago, when the scaling law was just emerging. So many things were foreseen early, but it takes ten years for them to truly land. The story of Cerebras is just one snapshot.

Similar Articles

@seclink: https://x.com/seclink/status/2056711091129118741

@snowboat84: https://x.com/snowboat84/status/2061962883651731602

@dongxi_nlp: A very valuable article, the last 6 takeaways are worth pondering. Among them, the last two: 5. The data industry is far from developed. Anthropic and OpenAI spend over $10 million on a single environment, while Chinese AI labs have a 'build rather than buy' mentality. 6. Countless...

Submit Feedback

Similar Articles

@paulwalker99318: This LatePost interview is packed with information about Baidu US R&D, Scaling Laws, OpenAI, Anthropic, and Cerebras. > "Dario joining Baidu was a very important step in his career. He was recruited by Greg Diamos. And before joining Baidu, Dario didn't have a computer science or AI background — he came from math, physics, and biology. Greg Diamos saw his intuition for AI and ability to train models."

@LuBtc888: 10 years ago, Justin Sun told everyone to pay attention to Nvidia. Later, Nvidia increased more than 200 times. Last year, he said: 'Short-term chip shortage, long-term energy shortage, always storage shortage.' As a result, the storage track also increased dozens of times. Recently, he also said: 'The dividend of virtual AI is almost over. The real main line in the next three years is physical AI!' Most importantly, he is not...

@seclink: https://x.com/seclink/status/2056711091129118741

@snowboat84: https://x.com/snowboat84/status/2061962883651731602

@dongxi_nlp: A very valuable article, the last 6 takeaways are worth pondering. Among them, the last two: 5. The data industry is far from developed. Anthropic and OpenAI spend over $10 million on a single environment, while Chinese AI labs have a 'build rather than buy' mentality. 6. Countless...