Teacher–student curriculum learning

OpenAI Blog Papers

Summary

OpenAI proposes Teacher–Student Curriculum Learning (TSCL), a framework where a Teacher algorithm automatically selects subtasks for a Student to learn complex tasks, optimizing based on learning curve slope and preventing forgetting. The approach matches or surpasses hand-crafted curricula on decimal addition and Minecraft navigation tasks, enabling solutions previously impossible with direct training.

No content available
Original Article Export to Word Export to PDF
View Cached Full Text

Cached at: 04/20/26, 02:45 PM

# Teacher–student curriculum learning Source: [https://openai.com/index/teacher-student-curriculum-learning/](https://openai.com/index/teacher-student-curriculum-learning/) OpenAI## Abstract We propose Teacher–Student Curriculum Learning \(TSCL\), a framework for automatic curriculum learning, where the Student tries to learn a complex task and the Teacher automatically chooses subtasks from a given set for the Student to train on\. We describe a family of Teacher algorithms that rely on the intuition that the Student should practice more those tasks on which it makes the fastest progress, i\.e\. where the slope of the learning curve is highest\. In addition, the Teacher algorithms address the problem of forgetting by also choosing tasks where the Student's performance is getting worse\. We demonstrate that TSCL matches or surpasses the results of carefully hand\-crafted curricula in two tasks: addition of decimal numbers with LSTM and navigation in Minecraft\. Using our automatically generated curriculum enabled to solve a Minecraft maze that could not be solved at all when training directly on solving the maze, and the learning was an order of magnitude faster than uniform sampling of subtasks\.

Similar Articles

Gathering human feedback

OpenAI Blog

OpenAI releases RL-Teacher, an open-source tool for training AI systems through human feedback instead of hand-crafted reward functions, with applications to safe AI development and complex reinforcement learning problems.

Interpretable machine learning through teaching

OpenAI Blog

OpenAI presents a machine teaching approach where a teacher neural network learns to select the most illustrative examples to teach a student network to recognize concepts, producing interpretable results by grounding examples in human-understandable properties rather than arbitrary feature encodings.

SchoolAI builds an AI platform that empowers teachers

OpenAI Blog

SchoolAI launches an AI platform built on OpenAI models that empowers teachers with real-time student progress signals and provides personalized learning support through conversational assistants (Dot) and AI tutors (Sidekick), reaching 1 million classrooms across 80+ countries in two years.

Learning to play Minecraft with Video PreTraining

OpenAI Blog

OpenAI introduced Video PreTraining (VPT), a semi-supervised method that trains neural networks to play Minecraft by learning from 70,000 hours of unlabeled human gameplay video combined with a small labeled dataset. The model learns complex sequential tasks using the native human interface (keyboard and mouse) and demonstrates capabilities like crafting diamond tools and pillar jumping, representing progress toward general computer-using agents.

Interpretable and pedagogical examples

OpenAI Blog

Research showing that iterative training of student-teacher neural networks produces interpretable teaching strategies, with the teacher learning to select or generate pedagogical examples that humans can understand and learn from effectively.