benchmark-study

Tag

Cards List
#benchmark-study

The Routing Plateau: Understanding and Breaking the Accuracy Limits of LLM Routers

arXiv cs.LG · 2026-06-09 Cached

This paper identifies a 'routing plateau' phenomenon where diverse LLM routing methods converge to similar accuracy, far below the oracle, due to a predictability bottleneck that limits query-specific routing. It then shows that larger datasets, stronger encoders, and fine-tuning can help break through this plateau.

0 favorites 0 likes
#benchmark-study

Large-scale study of curiosity-driven learning

OpenAI Blog · 2018-08-13 Cached

OpenAI presents a large-scale empirical study of curiosity-driven reinforcement learning without extrinsic rewards across 54 benchmark environments, showing strong performance and investigating the role of feature spaces in prediction-based reward signals.

0 favorites 0 likes
← Back to home

Submit Feedback