paper-of-the-day

#paper-of-the-day

@ClementDelangue: Paper of the day! https://huggingface.co/papers/2605.13301…

X AI KOLs Following ↗ · 2026-05-15 Cached

A paper introduces a unified recipe (SU-01) that combines reverse-perplexity curriculum, two-stage reinforcement learning, and test-time scaling to achieve gold-medal-level performance on IMO and IPhO problems using a 30B-A3B backbone.

0 favorites 0 likes

paper-of-the-day

@ClementDelangue: Paper of the day! https://huggingface.co/papers/2605.13301…

Submit Feedback