benchmark-study

#benchmark-study

The Routing Plateau: Understanding and Breaking the Accuracy Limits of LLM Routers

arXiv cs.LG ↗ · 2026-06-09 Cached

This paper identifies a 'routing plateau' phenomenon where diverse LLM routing methods converge to similar accuracy, far below the oracle, due to a predictability bottleneck that limits query-specific routing. It then shows that larger datasets, stronger encoders, and fine-tuning can help break through this plateau.

0 favorites 0 likes

#benchmark-study

Large-scale study of curiosity-driven learning

OpenAI Blog ↗ · 2018-08-13 Cached

OpenAI presents a large-scale empirical study of curiosity-driven reinforcement learning without extrinsic rewards across 54 benchmark environments, showing strong performance and investigating the role of feature spaces in prediction-based reward signals.

0 favorites 0 likes

benchmark-study

The Routing Plateau: Understanding and Breaking the Accuracy Limits of LLM Routers

Large-scale study of curiosity-driven learning

Submit Feedback