black-box-transfer

Tag

Cards List
#black-box-transfer

LLM-Agnostic Semantic Representation Attack

arXiv cs.CL · 2026-05-12 Cached

This paper introduces Semantic Representation Attack (SRA), a novel LLM-agnostic method that optimizes for malicious semantic representations rather than exact text, achieving high attack success rates across multiple open-source models.

0 favorites 0 likes
← Back to home

Submit Feedback