adaptive-sampling

#adaptive-sampling

Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling

Hugging Face Daily Papers ↗ · 2026-06-02 Cached

This paper formulates adaptive sampling for large language models as a Markov decision process and trains a lightweight RL controller to balance correctness, latency, and computational cost, achieving improved trade-offs.

0 favorites 0 likes

#adaptive-sampling

NOVA: Fundamental Limits of Knowledge Discovery Through AI

arXiv cs.AI ↗ · 2026-05-18 Cached

The NOVA framework models the 'generate, verify, accumulate, retrain' loop as an adaptive sampling process over a knowledge space, identifying failure modes and proving a scaling law for cumulative generation cost under Zipf-like discovery distributions.

0 favorites 0 likes

adaptive-sampling

Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling

NOVA: Fundamental Limits of Knowledge Discovery Through AI

Submit Feedback