paraphrasing-attack

#paraphrasing-attack

Paraphrasing Attack Resilience of Various AI-Generated Text Detection Methods

arXiv cs.LG ↗ · 2026-05-15 Cached

This paper investigates the resilience of AI-generated text detection methods (fine-tuned RoBERTa, Binoculars, text feature analysis, and ensembles) against paraphrasing attacks, finding that Binoculars-inclusive ensembles are most effective but also most vulnerable to attacks, highlighting a dichotomy between performance and resilience.

0 favorites 0 likes

paraphrasing-attack

Paraphrasing Attack Resilience of Various AI-Generated Text Detection Methods

Submit Feedback