Tag
A comprehensive AI engineering curriculum (AI Engineering from Scratch) with 20 phases, 416 lessons, covering linear algebra to autonomous swarms, built from raw math to code, free and open-source.
A paper introduces a unified recipe (SU-01) that combines reverse-perplexity curriculum, two-stage reinforcement learning, and test-time scaling to achieve gold-medal-level performance on IMO and IPhO problems using a 30B-A3B backbone.
A paper presenting SU-01, a 30B-A3B reasoning model that achieves gold-medal-level performance on IMO and IPhO problems via reverse-perplexity curriculum, two-stage reinforcement learning, and test-time scaling.
A curated collection of links to videos, repositories, guides, books, and papers for learning about AI, LLMs, and building AI agents.
OpenAI announces the completion of its Fall 2018 Fellows program and celebrates the fellows' research contributions. The organization also open-sourced part of the fellowship curriculum, including 'Spinning up in Deep RL,' an educational resource for learning reinforcement learning.