solver-amortized

Tag

Cards List
#solver-amortized

Breaking the Solver Bottleneck: Training Task Generators at the Learnable Frontier

arXiv cs.LG · yesterday Cached

Introduces PROPEL, a solver-amortized framework that trains a lightweight activation probe to predict solver pass rates, enabling efficient training of task generators for RL without costly solver rollouts. The method improves generation at the learnable frontier across math, code, and software-engineering tasks.

0 favorites 0 likes
← Back to home

Submit Feedback