Tag
This paper proposes a system that combines a prerequisite knowledge graph with a PPO-based policy to structure Socratic tutoring with LLMs, showing improved student mastery and efficiency over heuristic and frontier model baselines.