Tag
Summary of David Silver's Reinforcement Learning Lecture 8 on integrating learning and planning, covering model-based RL and AlphaGo's use of policy and value networks with Monte Carlo Tree Search.
NVIDIA and David Silver's Ineffable Intelligence have partnered to build the infrastructure for large-scale reinforcement learning, focusing on pipelines that generate data on the fly and leverage NVIDIA's next-generation platforms.