guard-models

Tag

Cards List
#guard-models

BraveGuard: From Open-World Threats to Safer Computer-Use Agents

Hugging Face Daily Papers · 2026-06-02 Cached

BraveGuard is a self-evolving defense framework that trains guard models using open-world threat signals and realistic agent trajectories to improve safety detection in computer-use agents, achieving significant accuracy gains on the AgentHazard benchmark.

0 favorites 0 likes
← Back to home

Submit Feedback