Tag
Discusses the persistent challenges that prevent AI agents from reliably handling real-world tasks, such as changing websites and inconsistent workflows, despite progress in task execution.
This paper introduces engagement forecasting for intelligent tutoring systems, predicting weekly minutes practiced and new skills mastered using interaction logs from 425 middle-school students. Feature-based models reduce error by 22-33% over heuristic baselines, offering explainable patterns for tutor-learner goal setting.
Fields Medalist Timothy Gowers reports using GPT5.5 Pro to solve open mathematical problems and predicts an imminent crisis in mathematical research due to rapid AI progress.