watermarking

#watermarking

SLAM: Structural Linguistic Activation Marking for Language Models

arXiv cs.CL ↗ · 2d ago Cached

SLAM is a novel white-box watermarking scheme that embeds marks into the structural geometry of LLM residual streams using sparse autoencoders, achieving 100% detection accuracy with minimal quality loss on Gemma-2 models, avoiding the token-distribution biasing of prior methods.

0 favorites 1 likes

#watermarking

Reversing SynthID

Lobsters Hottest ↗ · 2026-04-23 Cached

Security researcher details how Google’s SynthID invisible watermark for AI-generated images can be reversed, undermining media-provenance claims and highlighting fundamental flaws in proprietary watermarking schemes.

0 favorites 0 likes

#watermarking

Protecting Language Models Against Unauthorized Distillation through Trace Rewriting

arXiv cs.CL ↗ · 2026-04-20 Cached

This paper proposes methods for protecting large language models against unauthorized knowledge distillation by rewriting reasoning traces to degrade training usefulness while preserving correctness, and embedding verifiable watermarks in distilled student models. The approach uses instruction-based and gradient-based rewriting techniques to achieve anti-distillation effects without compromising teacher model performance.

0 favorites 0 likes

#watermarking

A Linguistics-Aware LLM Watermarking via Syntactic Predictability

arXiv cs.CL ↗ · 2026-04-20 Cached

This paper introduces STELA, a linguistics-aware watermarking framework for LLMs that leverages syntactic predictability via POS n-grams to balance text quality and detection robustness. The method enables publicly verifiable watermark detection without requiring access to model logits, demonstrating superior performance across typologically diverse languages (English, Chinese, Korean).

0 favorites 0 likes

#watermarking

@GoogleDeepMind: More natural sounding speech Support for 70+ languages like Hindi, Japanese, and German SynthID watermarking on all out…

X AI KOLs ↗ · 2026-04-15 Cached

Google DeepMind upgraded its speech synthesis model to sound more natural across 70+ languages and now applies SynthID watermarking to all outputs.

0 favorites 0 likes

#watermarking

SynthID Detector — a new portal to help identify AI-generated content

Google DeepMind Blog ↗ · 2025-05-20 Cached

Google announced SynthID Detector, a verification portal that identifies AI-generated content across images, audio, video, and text by detecting imperceptible SynthID watermarks embedded in media created with Google's AI tools. The platform is rolling out to early testers with plans for broader availability to journalists, media professionals, and researchers.

0 favorites 0 likes

#watermarking

Understanding the source of what we see and hear online

OpenAI Blog ↗ · 2024-05-07 Cached

OpenAI announces tools and research efforts to help verify content authenticity, including text watermarking, metadata approaches, and expanded image detection with C2PA metadata integration for tracking AI-generated and edited content.

0 favorites 0 likes

watermarking

SLAM: Structural Linguistic Activation Marking for Language Models

Reversing SynthID

Protecting Language Models Against Unauthorized Distillation through Trace Rewriting

A Linguistics-Aware LLM Watermarking via Syntactic Predictability

@GoogleDeepMind: More natural sounding speech Support for 70+ languages like Hindi, Japanese, and German SynthID watermarking on all out…

SynthID Detector — a new portal to help identify AI-generated content

Understanding the source of what we see and hear online

Submit Feedback