tool-augmented-llm

Tag

Cards List
#tool-augmented-llm

ToolMenuBench: Benchmarking Tool-Menu Filtering Strategies for Reliable and Efficient LLM Agents

arXiv cs.AI · 2026-06-16 Cached

ToolMenuBench is a benchmark for evaluating tool-menu filtering strategies in multi-step LLM agents. It shows that causal minimal tool filtering significantly improves task success and reduces token usage compared to unfiltered exposure.

0 favorites 0 likes
← Back to home

Submit Feedback