skill-layer

#skill-layer

FORTIS: Benchmarking Over-Privilege in Agent Skills

Hugging Face Daily Papers ↗ · 2026-05-09 Cached

FORTIS benchmarks how LLM agents frequently exceed necessary privileges when selecting skills, showing over-privilege is the norm across ten frontier models and failing under realistic user interactions.

0 favorites 0 likes

skill-layer

FORTIS: Benchmarking Over-Privilege in Agent Skills

Submit Feedback