black-box-transfer

#black-box-transfer

LLM-Agnostic Semantic Representation Attack

arXiv cs.CL ↗ · 2026-05-12 Cached

This paper introduces Semantic Representation Attack (SRA), a novel LLM-agnostic method that optimizes for malicious semantic representations rather than exact text, achieving high attack success rates across multiple open-source models.

0 favorites 0 likes

black-box-transfer

LLM-Agnostic Semantic Representation Attack

Submit Feedback