safety-risks

Tag

Cards List
#safety-risks

On Safety Risks in Experience-Driven Self-Evolving Agents

arXiv cs.CL · 2026-04-21 Cached

Researchers from Harbin Institute of Technology and Singapore Management University investigate safety risks in experience-driven self-evolving LLM agents, finding that even benign task experience can compromise safety in high-risk scenarios due to agents' execution-oriented tendencies, and revealing a fundamental safety–utility trade-off.

0 favorites 0 likes
← Back to home

Submit Feedback