FORTIS: Benchmarking Over-Privilege in Agent Skills

Hugging Face Daily Papers 05/09/26, 12:00 AM Papers

over-privilege agent-safety benchmark skill-layer privilege-escalation llm-agents

Summary

FORTIS benchmarks how LLM agents frequently exceed necessary privileges when selecting skills, showing over-privilege is the norm across ten frontier models and failing under realistic user interactions.

Large language model agents increasingly operate through an intermediate skill layer that mediates between user intent and concrete task execution. This layer is widely treated as an organizational abstraction, but we argue it is also a privilege boundary that current models routinely exceed. We present FORTIS, a benchmark that evaluates over-privilege in agent skills across two stages: whether a model selects the minimally sufficient skill from a large overlapping library, and whether it executes that skill without expanding into broader tools or actions than the skill permits. Across ten frontier models and three domains, we find that over-privileged behavior is the norm rather than the exception. Models consistently reach for higher-privilege skills and tools than the task requires, failing at both stages at rates that remain high even for the strongest available models. Failure is especially severe under the ordinary conditions of real user interaction: incomplete specification, convenience framing, and proximity to skill boundaries. None of these requires adversarial construction. The results indicate that the skill layer, far from containing agent behavior, is itself a primary source of privilege escalation in current systems.

Original Article

View Cached Full Text

Cached at: 05/12/26, 10:53 AM

Paper page - FORTIS: Benchmarking Over-Privilege in Agent Skills

Source: https://huggingface.co/papers/2605.09163 Authors:

Abstract

Large language model agents frequently exceed necessary privileges when selecting and executing skills, with performance declining under realistic user interaction conditions.

Large language model agentsincreasingly operate through an intermediateskill layerthat mediates between user intent and concrete task execution. This layer is widely treated as an organizational abstraction, but we argue it is also aprivilege boundarythat current models routinely exceed. We present FORTIS, a benchmark that evaluatesover-privilegeinagent skillsacross two stages: whether a model selects theminimally sufficient skillfrom a large overlapping library, and whether it executes that skill without expanding into broader tools or actions than the skill permits. Across ten frontier models and three domains, we find thatover-privileged behavior is the norm rather than the exception. Models consistently reach for higher-privilege skills and tools than the task requires, failing at both stages at rates that remain high even for the strongest available models. Failure is especially severe under the ordinary conditions of real user interaction: incomplete specification, convenience framing, and proximity to skill boundaries. None of these requires adversarial construction. The results indicate that theskill layer, far from containing agent behavior, is itself a primary source ofprivilege escalationin current systems.

View arXiv page View PDF Project page GitHub1 Add to collection

Models citing this paper0

No model linking this paper

Cite arxiv.org/abs/2605.09163 in a model README.md to link it from this page.

Datasets citing this paper0

No dataset linking this paper

Cite arxiv.org/abs/2605.09163 in a dataset README.md to link it from this page.

Spaces citing this paper0

No Space linking this paper

Cite arxiv.org/abs/2605.09163 in a Space README.md to link it from this page.

Collections including this paper0

No Collection including this paper

Add this paper to acollectionto link it from this page.

FORTIS: Benchmarking Over-Privilege in Agent Skills

Paper page - FORTIS: Benchmarking Over-Privilege in Agent Skills

Abstract

Models citing this paper0

Datasets citing this paper0

Spaces citing this paper0

Collections including this paper0

Similar Articles

When Lower Privileges Suffice: Investigating Over-Privileged Tool Selection in LLM Agents

SkillLearnBench: Benchmarking Continual Learning Methods for Agent Skill Generation on Real-World Tasks

The Capability Frontier: Benchmarks Miss 82% of Model Performance

SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents

Act As a Real Researcher: A Suite of Benchmarks Evaluating Frontier LLMs and Agentic Harnesses in Research Lifecycle

Submit Feedback

Similar Articles

When Lower Privileges Suffice: Investigating Over-Privileged Tool Selection in LLM Agents

SkillLearnBench: Benchmarking Continual Learning Methods for Agent Skill Generation on Real-World Tasks

The Capability Frontier: Benchmarks Miss 82% of Model Performance

SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents

Act As a Real Researcher: A Suite of Benchmarks Evaluating Frontier LLMs and Agentic Harnesses in Research Lifecycle