path-pruning

#path-pruning

Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning

arXiv cs.CL ↗ · 2026-04-20 Cached

This paper proposes STOP (SuperTOken for Pruning), a systematic framework for pruning inefficient reasoning paths early in parallel reasoning with Large Reasoning Models. The method achieves superior efficiency and effectiveness across models from 1.5B to 20B parameters, boosting GPT-OSS-20B accuracy on AIME25 from 84% to 90% under fixed compute budgets.

0 favorites 0 likes

path-pruning

Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning

Submit Feedback