Tag
This paper introduces a method to make Tiny Recursive Models stochastic at test time by adding Gaussian noise and running parallel rollouts, achieving dramatic performance gains on PPBench and Sudoku-Extreme without retraining.