Tag
ToolMenuBench is a benchmark for evaluating tool-menu filtering strategies in multi-step LLM agents. It shows that causal minimal tool filtering significantly improves task success and reduces token usage compared to unfiltered exposure.