inference-time-control

#inference-time-control

Inference-Time Budget Control for LLM Search Agents

arXiv cs.AI ↗ · yesterday Cached

This paper introduces a two-stage inference-time budget control method for LLM search agents, using Value-of-Information scores to optimize tool-call and token allocation during multi-hop question answering.

0 favorites 0 likes

inference-time-control

Inference-Time Budget Control for LLM Search Agents

Submit Feedback