tool-augmented-llm

#tool-augmented-llm

ToolMenuBench: Benchmarking Tool-Menu Filtering Strategies for Reliable and Efficient LLM Agents

arXiv cs.AI ↗ · 2026-06-16 Cached

ToolMenuBench is a benchmark for evaluating tool-menu filtering strategies in multi-step LLM agents. It shows that causal minimal tool filtering significantly improves task success and reduces token usage compared to unfiltered exposure.

0 favorites 0 likes

tool-augmented-llm

ToolMenuBench: Benchmarking Tool-Menu Filtering Strategies for Reliable and Efficient LLM Agents

Submit Feedback