social-deduction-games

Tag

Cards List
#social-deduction-games

Evaluating Large Language Models in a Complex Hidden Role Game

arXiv cs.CL · 2026-05-25 Cached

This paper introduces an open-source framework to evaluate LLMs' reasoning, persuasion, and deception capabilities in the hidden role game Secret Hitler, finding that current models fail at sustained multi-turn manipulation while rule-based agents outperform them.

0 favorites 0 likes
← Back to home

Submit Feedback