skill-layer

Tag

Cards List
#skill-layer

FORTIS: Benchmarking Over-Privilege in Agent Skills

Hugging Face Daily Papers · 2026-05-09 Cached

FORTIS benchmarks how LLM agents frequently exceed necessary privileges when selecting skills, showing over-privilege is the norm across ten frontier models and failing under realistic user interactions.

0 favorites 0 likes
← Back to home

Submit Feedback