@paulwalker99318: This LatePost interview is packed with information about Baidu US R&D, Scaling Laws, OpenAI, Anthropic, and Cerebras. > "Dario joining Baidu was a very important step in his career. He was recruited by Greg Diamos. And before joining Baidu, Dario didn't have a computer science or AI background — he came from math, physics, and biology. Greg Diamos saw his intuition for AI and ability to train models."

X AI KOLs Timeline News

Summary

A summary of the LatePost interview, reviewing Baidu US R&D's early AI布局, including investing in Cerebras, nearly investing in OpenAI and Anthropic, and the flow of talent from Baidu to these companies.

This LatePost interview is packed with information about Baidu US R&D, Scaling Laws, OpenAI, Anthropic, and Cerebras. > "Dario joining Baidu was a very important step in his career. He was recruited by Greg Diamos. And before joining Baidu, Dario didn't have a computer science or AI background — he came from math, physics, and biology. Greg Diamos saw his intuition for AI and ability to train models." > Ten years ago (before the Transformer), Baidu was already training language models with nearly 300 million parameters — around the time the seeds of the Scaling Law were discovered. Training took over three months on GPUs, and the model was based on Baidu's own framework — PaddlePaddle. > Sam Altman himself is an investor in Cerebras. Baidu invested in 2017; Sam Altman invested in 2016. > Baidu's early investment in Cerebras was decided in just two days by Robin Li, Qi Lu, and the CFO — demonstrating how forward-looking Baidu's investment vision was at the time. > Baidu once had the chance to be an early angel investor in OpenAI and Anthropic. At that time, companies like OpenAI, Databricks, and Scale AI were all on Baidu's potential investment list. Unfortunately, the deteriorating US-China relationship prevented the deals from going through. > Qi Lu was once Sam Altman's mentor. After leaving Baidu in May 2018, Qi Lu accepted Sam Altman's invitation in August of the same year to become the founder and CEO of YC China. > In the summer of 2020, some former Baidu employees at OpenAI said GPT-3 was about to be trained — the thing they wanted to do back at Baidu was about to be achieved at OpenAI: a Wikipedia-level language model. At that time, GPT-3 was still in post-training, more than two years before ChatGPT came out. > At its peak, Baidu US R&D had at least 250 people, with very high talent density — even higher than Google DeepMind at times. Many came because of Andrew Ng. > Many of these people later joined core AI startups or founded their own. Besides those at OpenAI and Anthropic mentioned earlier, some co-founded Adept, xAI, and others became key members of labs like Meta FAIR. https://mp.weixin.qq.com/s/INA72IMxDN6kzz3r30xonA?scene=1&click_id=5…
Original Article
View Cached Full Text

Cached at: 06/23/26, 01:56 AM

The LatePost interview with LateTask is packed with information about Baidu Research US, the Scaling Law, OpenAI, Anthropic, and Cerebras.

“Dario joining Baidu was actually a very important step in his career. He was recruited by Greg Diamos. And before joining Baidu, Dario was not a computer science or AI professional; his background was in math, physics, and biology. Greg Diamos recognized that he had great AI intuition and skill in training models.”

Ten years ago (before the Transformer), Baidu was already training a language model with nearly 300 million parameters — around the time the Scaling Law was first taking shape. Training it on GPUs took over three months. The model was based on Baidu’s in-house framework, PaddlePaddle.

Sam Altman himself is an investor in Cerebras. Baidu invested in 2017, while Sam Altman invested in 2016.

Baidu’s early investment in Cerebras was decided in just two days by Robin Li, Qi Lu, and the CFO — a testament to Baidu’s forward-looking investment vision at the time.

Baidu once had the opportunity to be an early angel investor in OpenAI and Anthropic. At that time, companies like OpenAI, Databricks, and Scale AI were all on Baidu’s investment shortlist. Unfortunately, the deteriorating US-China relations prevented the deals from going through.

Qi Lu was an early mentor to Sam Altman. After leaving Baidu in May 2018, Qi Lu accepted Sam Altman’s invitation in August of the same year to become the founder and CEO of YC China.

In the summer of 2020, former Baidu employees working at OpenAI said that GPT-3 was about to be trained — that what they had wanted to do at Baidu was almost achieved at OpenAI: a Wikipedia-level language model. At that time, GPT-3 was still in the post-training phase, more than two years before ChatGPT was released.

At its peak, Baidu Research US had at least 250 people, with a very high density of talent — even higher than at Google DeepMind at the time. Many had come specifically to work with Andrew Ng.

Later, many of these people joined core AI startups or started their own companies. Besides the aforementioned OpenAI and Anthropic, some co-founded Adept and xAI, while others became key members of labs like Meta FAIR.

https://mp.weixin.qq.com/s/INA72IMxDN6kzz3r30xonA?scene=1&click_id=5…


Source: https://mp.weixin.qq.com/mp/wappoc_appmsgcaptcha?poc_token=HN_nOWqjwKkHjALFjtc9LKfcaHCSUsr27JmZf5yO&target_url=https%3A%2F%2Fmp.weixin.qq.com%2Fs%2FINA72IMxDN6kzz3r30xonA%3Fscene%3D1%26click_id%3D5

Environment Anomaly

The current environment is anomalous. Please complete the verification to continue accessing.

Similar Articles

@SaitoWu: A group at Baidu Research US predicted ten years ago: Don't bet all AI compute on NVIDIA. So they actually invested in a 'wafer-scale' chip company — Cerebras. In 2016, Zhou Nan left investment banking for Baidu's US AI research institute. Andrew Ng was leading the team, budgets were ample, GPUs were bought freely. Dario (An…

X AI KOLs Timeline

The article recounts Baidu Research US's investment in Cerebras, a wafer-scale chip company, a decade ago. It analyzes the shift in the AI chip market from training to inference and the importance of non-consensus investments.

@WSInsights: A 25-year-old podcast host over the past two years has interviewed the key figures from top AI labs like OpenAI, Anthropic, and DeepMind. Karpathy, Hassabis, Dario Amodei, Ilya Sutskever — all the big names in the field...

X AI KOLs Timeline

25-year-old podcast host Dwarkesh Patel has interviewed key figures from top AI labs including OpenAI, Anthropic, and DeepMind, such as Karpathy, Hassabis, Dario Amodei, and Ilya Sutskever. He publicly shared his AI-assisted "one-week preparation" workflow: having AI列出必读资料, tracking gaps in understanding, using AI to map out the full landscape, and implementing the code himself. Time magazine included him in the "AI 100" list for 2024.

@beefnoode: Last night I had a coffee chat with @Charles77xixi, and we really enjoyed the conversation. Teacher Charles is relatively young, but has already achieved great results in the AI field. He graduated from a top school and entered a top-tier big company, and now works at a star AI startup. In terms of technology, even though I…

X AI KOLs Following

The blogger shares insights from a coffee chat with AI practitioner Charles, discussing the choice between big companies and AI startups, entrepreneurial paths, differences between B2B and B2C markets, and the importance of networking and resources.

@ba_niu80557: Let's talk some hardcore practical knowledge while I have time this morning. What actually happens between signing a contract for an AI project and it finally running in production? I'll lay out the entire playbook. People in this field can copy it directly, and those not in it can still understand why 95% of enterprise AI pilots end up dead. First, let me say something counterintuitive to the point you might not believe...

X AI KOLs Timeline

This article discusses common reasons for the failure of enterprise AI projects from proof-of-concept to production deployment, highlighting key practices such as MLOps, early inspection of real data, and clear human-machine boundaries. It argues that project failures are often not due to model issues but due to neglect of the engineering implementation phase.

@AlchainHust: Spent most of a day listening to the 4-hour interview between Zhang Xiaojun and Yao Shunyu. This guy, who just moved from Anthropic to Google DeepMind last year, has worked on Claude 3.7/4.5 and Gemini 3. He offered many candid perspectives from a frontline researcher at a top large model lab. The interview is incredibly information-dense…

X AI KOLs Timeline

This article summarizes Zhang Xiaojun's interview with Yao Shunyu, a researcher involved in developing Claude and Gemini, who shares 10 insightful views on AI code generation, company culture, scaling laws, etc.