Tag
A new feature called OpenResearch allows reproducing and experimenting on papers, with a one-click template to train Vector Policy Optimization (VPO) on ToolRL, enabling diverse answer generation and improved test-time search.
Andrej Karpathy open-sourced an autonomous research agent that runs its own ML experiments overnight using a single GPU, automatically iterating on improvements by editing code and keeping changes that lower validation loss.