theory-grounded

Tag

Cards List
#theory-grounded

DECOR: Auditing LLM Deception via Information Manipulation Theory

arXiv cs.CL · 2026-05-20 Cached

Introduces DECOR, a multi-agent framework grounded in Information Manipulation Theory for fine-grained auditing of strategic deception in LLM responses, achieving state-of-the-art performance on deception detection benchmarks across 15 frontier models.

0 favorites 0 likes
← Back to home

Submit Feedback