ai-text-detection

#ai-text-detection

Amplifying, Not Learning: Fine-Tuned AI Text Detectors Amplify a Pretrained Direction

arXiv cs.LG ↗ · 2026-05-22 Cached

This paper demonstrates that fine-tuned AI text detectors amplify a pretrained typicality axis rather than learning an AI-vs-human boundary, with raw encoder projections often matching or exceeding fine-tuned performance.

0 favorites 0 likes

#ai-text-detection

Paraphrasing Attack Resilience of Various AI-Generated Text Detection Methods

arXiv cs.LG ↗ · 2026-05-15 Cached

This paper investigates the resilience of AI-generated text detection methods (fine-tuned RoBERTa, Binoculars, text feature analysis, and ensembles) against paraphrasing attacks, finding that Binoculars-inclusive ensembles are most effective but also most vulnerable to attacks, highlighting a dichotomy between performance and resilience.

0 favorites 0 likes

#ai-text-detection

MELD: Multi-Task Equilibrated Learning Detector for AI-Generated Text

arXiv cs.CL ↗ · 2026-05-11 Cached

This paper introduces MELD, a detector for AI-generated text that uses multi-task learning with auxiliary heads for generator family, attack type, and source domain to improve robustness. MELD achieves strong performance on the RAID benchmark and maintains low false-positive rates under adversarial attacks.

0 favorites 0 likes

ai-text-detection

Amplifying, Not Learning: Fine-Tuned AI Text Detectors Amplify a Pretrained Direction

Paraphrasing Attack Resilience of Various AI-Generated Text Detection Methods

MELD: Multi-Task Equilibrated Learning Detector for AI-Generated Text

Submit Feedback