social-deduction-games

#social-deduction-games

Evaluating Large Language Models in a Complex Hidden Role Game

arXiv cs.CL ↗ · 2026-05-25 Cached

This paper introduces an open-source framework to evaluate LLMs' reasoning, persuasion, and deception capabilities in the hidden role game Secret Hitler, finding that current models fail at sustained multi-turn manipulation while rule-based agents outperform them.

0 favorites 0 likes

social-deduction-games

Evaluating Large Language Models in a Complex Hidden Role Game

Submit Feedback