NeurIPS used uncalibrated AI detector for desk rejections [D]
Summary
A submission was desk-rejected from NeurIPS based on an uncalibrated AI detector (Pangram), raising concerns about circularity in the review process and unvalidated false-positive rates on the target distribution.
Similar Articles
Top AI conference uses AI detector to reject papers for allegedly being written by AI
NeurIPS 2026 used a proprietary AI-text detector to desk-reject papers for alleged AI policy violations without validating it on the target distribution; the same detector later flagged conference chairs' own papers as likely AI-written.
Base Models Look Human To AI Detectors
This paper reveals that commercial AI detectors like GPTZero and Pangram judge text from base language models as overwhelmingly human, while instruction-tuned model outputs are flagged as AI-generated. The authors propose HIP, a detector-agnostic iterative paraphrasing pipeline that improves human-likeness while preserving semantics.
Sem-Detect: Semantic Level Detection of AI Generated Peer-Reviews
Sem-Detect introduces a method to distinguish AI-generated peer reviews from human-written ones by combining textual features with claim-level semantic analysis. It achieves a 25.5% improvement in true positive rate at 0.1% false positive rate over baselines, and shows that LLM-refined human reviews retain distinct semantic signals, with fewer than 3.5% misclassified as AI-generated.
Here's an AI Bullshit Detector: I use it daily and it catches things you won't see on your own
A tool called Lighthouse, built by an AI governance engineer, uses runtime validation to detect epistemic drift and confident-sounding nonsense in AI output and writing.
Micro-Defects Expose Macro-Fakes: Detecting AI-Generated Images via Local Distributional Shifts
A local distribution-aware detection framework that amplifies micro-scale statistical irregularities to identify AI-generated images with improved accuracy, outperforming baseline detectors across benchmarks.