Dota 2 with large scale deep reinforcement learning

OpenAI Blog Papers

Summary

OpenAI Five became the first AI system to defeat Dota 2 world champions using large-scale deep reinforcement learning with self-play, demonstrating superhuman performance on a complex game with long time horizons and imperfect information.

No content available
Original Article
View Cached Full Text

Cached at: 04/20/26, 02:52 PM

# Dota 2 with large scale deep reinforcement learning Source: [https://openai.com/index/dota-2-with-large-scale-deep-reinforcement-learning/](https://openai.com/index/dota-2-with-large-scale-deep-reinforcement-learning/) ## Abstract On April 13th, 2019, OpenAI Five became the first AI system to defeat the world champions at an esports game\. The game of Dota 2 presents novel challenges for AI systems such as long time horizons, imperfect information, and complex, continuous state\-action spaces, all challenges which will become increasingly central to more capable AI systems\. OpenAI Five leveraged existing reinforcement learning techniques, scaled to learn from batches of approximately 2 million frames every 2 seconds\. We developed a distributed training system and tools for continual training which allowed us to train OpenAI Five for 10 months\. By defeating the Dota 2 world champion \(Team OG\), OpenAI Five demonstrates that self\-play reinforcement learning can achieve superhuman performance on a difficult task\.

Similar Articles

Dota 2

OpenAI Blog

OpenAI created a bot that defeats world-class Dota 2 professionals in 1v1 matches using only self-play learning, without imitation learning or tree search. The achievement demonstrates progress toward AI systems that can accomplish complex goals in dynamic, multi-agent environments.

OpenAI Five defeats Dota 2 world champions

OpenAI Blog

OpenAI Five becomes the first AI to defeat world-champion esports professionals in Dota 2, winning two back-to-back matches against OG at the OpenAI Five Finals. The breakthrough was achieved through unprecedented scaling of training compute rather than novel algorithms, and the team is retiring OpenAI Five while announcing plans to deploy it for public internet play.

OpenAI Five

OpenAI Blog

OpenAI Five is a reinforcement learning agent that masters Dota 2 through self-play training with curriculum learning and strategic randomization, progressing from random behavior to executing complex human-level strategies.

OpenAI Five Benchmark

OpenAI Blog

OpenAI Five completed a benchmark match against humans in Dota 2, demonstrating improved capabilities including expanded hero pool (18 heroes), Roshan pit mechanics, and wards. The system shows general training flexibility in acquiring complex game skills.

The International 2018: Results

OpenAI Blog

OpenAI Five competed against top professional Dota 2 teams at The International 2018, losing both matches against elite human players while demonstrating competitive gameplay and strategic depth developed through self-taught learning.