@dwarkesh_sp: New blackboard lecture w @ericjang11 He walks through how to build AlphaGo from scratch, but with modern AI tools. Some…
Summary
A blackboard lecture by Eric Jang walks through building AlphaGo from scratch with modern AI tools, covering RL, MCTS, self-play, and connecting to LLM training, along with a discussion on automated AI research.
Similar Articles
@Michaelzsguo: This is one of the best deep discussions I've seen recently about the fundamentals of reinforcement learning and its relationship to modern AI. Eric Jang and Dwarkesh turned a seemingly retro exercise—rebuilding AlphaGo with today's tools—into a very clear masterclass: why 'search +...'
A detailed discussion on reinforcement learning and its connection to modern AI, using the reconstruction of AlphaGo with modern tools as a clear example of search and self-play. Key takeaways include neural network amortization of search, credit assignment challenges in LLMs vs AlphaGo, and implications for automated research.
@ericjang11: For the last few months I've been working on a from-scratch implementation of AlphaGo, a 2016 AI breakthrough that insp…
Eric Jang releases AutoGo, a from-scratch tutorial for implementing AlphaGo, including code and a playable bot, demonstrating that frontier capabilities can now be replicated affordably.
Building AlphaGo from scratch – Eric Jang
Eric Jang rebuilt AlphaGo from scratch and explained in detail the application of Monte Carlo Tree Search and deep learning in Go, demonstrating the feasibility of reproducing a powerful Go AI at low cost nowadays.
@enoch4306: You absolutely cannot miss this!! A tutorial on building AlphaGo from scratch with AI. Chinese subtitles. http://pan.quark.cn/s/06bd1030d137 .
Recommend a Chinese-subtitled tutorial on building AlphaGo from scratch, suitable for learning AI and reinforcement learning.
@ickma2311: David Silver RL Course (Lecture 8): Integrating Learning and Planning AlphaGo is a beautiful example of integrating lea…
Summary of David Silver's Reinforcement Learning Lecture 8 on integrating learning and planning, covering model-based RL and AlphaGo's use of policy and value networks with Monte Carlo Tree Search.