Dota 2 with large scale deep reinforcement learning

OpenAI Blog 12/13/19, 08:00 AM Papers

reinforcement-learning dota-2 deep-learning game-ai openai-five self-play superhuman-performance

Summary

OpenAI Five became the first AI system to defeat Dota 2 world champions using large-scale deep reinforcement learning with self-play, demonstrating superhuman performance on a complex game with long time horizons and imperfect information.

No content available

Original Article

View Cached Full Text

Cached at: 04/20/26, 02:52 PM

# Dota 2 with large scale deep reinforcement learning Source: [https://openai.com/index/dota-2-with-large-scale-deep-reinforcement-learning/](https://openai.com/index/dota-2-with-large-scale-deep-reinforcement-learning/) ## Abstract On April 13th, 2019, OpenAI Five became the first AI system to defeat the world champions at an esports game\. The game of Dota 2 presents novel challenges for AI systems such as long time horizons, imperfect information, and complex, continuous state\-action spaces, all challenges which will become increasingly central to more capable AI systems\. OpenAI Five leveraged existing reinforcement learning techniques, scaled to learn from batches of approximately 2 million frames every 2 seconds\. We developed a distributed training system and tools for continual training which allowed us to train OpenAI Five for 10 months\. By defeating the Dota 2 world champion \(Team OG\), OpenAI Five demonstrates that self\-play reinforcement learning can achieve superhuman performance on a difficult task\.

Dota 2 with large scale deep reinforcement learning

Similar Articles

Dota 2

OpenAI Five defeats Dota 2 world champions

OpenAI Five

OpenAI Five Benchmark

The International 2018: Results

Submit Feedback