Tag
This paper introduces the CIFAR Synthetic Evidence Corpus, a dataset designed for detecting AI-generated evidence in legal contexts. It spans multiple document types and manipulation strategies, includes structured metadata, and provides a benchmark suite for evaluating detection systems.
User expresses confusion over distinguishing real from AI-generated video, highlighting the growing realism of synthetic media.
A large-scale empirical study analyzes 284 linguistic features across 27 LLMs and 10 text domains to assess which features reliably detect AI-generated text. The study finds that lexical richness measures are the most robust cross-domain and cross-model signals, while many other proposed indicators are strongly context-dependent.
This paper investigates whether open-source quantized LLMs encode a linearly separable truthfulness signal in their hidden states. Across three 7B-8B instruction-tuned models, a linear probe on a single mid-network layer achieves 0.904-1.000 AUROC on hallucination detection benchmarks, outperforming sampling-based methods.
The article explores the current state of AI-generated writing, its detection, and the implications for education and literature, referencing the Granta controversy where a story suspected to be AI-written won a prize.
Introduces LUNA, a linguistics-aware LLM watermarking method that achieves non-distortionary embedding and model-free detection across multiple languages, significantly improving AUROC and perplexity preservation.
Introduces SynCred-Bench, a benchmark of 600 AI-generated misinformation images across six credible-form categories, showing that existing detectors (including MLLMs, open-source AIGC detectors, and commercial APIs) perform poorly, with human annotators also struggling.
A user describes how they've developed an intuition for detecting ChatGPT's writing style, noting patterns that persist even after editing, and confirms this using the Lynote AI detector.
NVIDIA's LocateAnything, a vision-language detection model rethinking bounding box prediction, is now available as a Hugging Face Space and trending #1 on the platform. The space template was created by @_akhaliq.
NVIDIA's research team released LocateAnything, a vision-language detection model that rethinks bounding box prediction, which is trending #1 on HuggingFace.
Introduces TADDLE, a tool-augmented agent for detecting deficient LLM-generated peer reviews, along with an expert-annotated benchmark of 1,800 reviews on 50 ICLR 2025 papers. The system decomposes detection into four specialized analysis tools and uses two-stage semi-supervised learning for binary and multi-label classification.
This paper reveals the existence of hidden human-like spans in machine-generated texts and proposes a model-agnostic stacked enhancement framework that improves existing detectors by reducing the influence of these spans.
A user points out that Substack is flooded with obvious AI-generated content that people praise because they are too anti-AI to recognize AI prose.
An article discussing the controversy over a prize-winning short story that was accused of being generated by AI, and the broader implications for authorship and detection in the age of large language models.
This paper presents findings from the Counter Turing Test shared task on AI-generated text detection, with top systems achieving perfect binary classification but significantly lower performance in model attribution, highlighting the difficulty of distinguishing outputs from different large language models.
A computational framework combining prompt-based filtering and unsupervised clustering to identify manipulative political narrative clusters from social media posts without predefined categories.
OpenAI introduces Daybreak, a tool to automate security detection, validation, and response.
OpenAI releases GPT-2 1.5B model with analysis of human perception of credibility, potential for misuse through fine-tuning on extremist ideologies, and challenges in detecting synthetic text. Detection models achieve ~95% accuracy but require complementary approaches for practical deployment.
roboflow/supervision is an open-source Python toolkit for computer vision that provides reusable building blocks for data loading, annotation, and real-time processing, with model-agnostic support for popular libraries.