IF LLMS HAVE HUMAN-LIKE ATTRIBUTES, THEN SO DOES Age of Empires II
Summary
This paper argues that attributing human-like attributes to large language models is problematic because similar claims could be made about simpler systems, such as an AI trained on Age of Empires II, and proposes a null assumption of non-uniqueness to avoid circular reasoning.
View Cached Full Text
Cached at: 06/18/26, 03:43 PM
# If LLMs Have Human-Like Attributes, Then So Does Age of Empires II Source: [https://arxiv.org/abs/2605.31514](https://arxiv.org/abs/2605.31514) ## Computer Science \> Computation and Language **arXiv:2605\.31514**\(cs\) \[Submitted on 29 May 2026 \([v1](https://arxiv.org/abs/2605.31514v1)\), last revised 11 Jun 2026 \(this version, v3\)\] [View PDF](https://arxiv.org/pdf/2605.31514) > Abstract:Much research has been carried out on large language models \(LLMs\) and LLM\-powered agentic workflows\. However, many works within the field state emergence of, ascribe to, or assume, generalised anthropomorphic attributes to them \(e\.g\., morality or understanding of natural language\)\. Our goal is not to argue in favour or against the existence of these attributes, but to point out that these conclusions could be incorrect\. For this we build and train a simple neural network on the videogame Age of Empires II, and note that any entity in a sufficiently\-powerful substrate, such as LEGO or the Greater Boston Area, could also present such attributes\. Hence, the purported anthropomorphic attributes of LLMs are empirically non\-unique: although some properties \(e\.g\., responses to prompts\) could remain invariant, others, such as the interpretation of their perceived behaviour, might change with the substrate\. Thus, any empirically\-grounded discussion on these attributes requires explicit measurement criteria; otherwise the interpretation is left to the representation\. We then show that assuming that these attributes exist or not in a system, independent of the substrate and in a generalised way, leads to either circular or uninformative conclusions\. This is regardless of the experimenter's viewpoint on the subject, or whether the outcome shows existence or non\-existence\. Finally we propose a 'null' assumption, where one assumes LLM non\-uniqueness instead of assuming anthropomorphic attributes to set up an experiment, along with examples of it\. We also discuss potential objections to our work, briefly survey the field, and prove that Age of Empires II is functionally\- and Turing\-complete\. ## Submission history From: Adrian de Wynter \[[view email](https://arxiv.org/show-email/6fdfecaa/2605.31514)\] **[\[v1\]](https://arxiv.org/abs/2605.31514v1)**Fri, 29 May 2026 16:31:31 UTC \(13,704 KB\) **[\[v2\]](https://arxiv.org/abs/2605.31514v2)**Mon, 1 Jun 2026 21:31:22 UTC \(13,705 KB\) **\[v3\]**Thu, 11 Jun 2026 16:55:07 UTC \(13,701 KB\) Bibliographic Tools ## Bibliographic and Citation Tools Bibliographic Explorer Toggle Code, Data, Media ## Code, Data and Media Associated with this Article Demos ## Demos Related Papers ## Recommenders and Search Tools About arXivLabs ## arXivLabs: experimental projects with community collaborators arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website\. Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy\. arXiv is committed to these values and only works with partners that adhere to them\. Have an idea for a project that will add value for arXiv's community?[**Learn more about arXivLabs**](https://info.arxiv.org/labs/index.html)\.
Similar Articles
@MilesCranmer: This is an insane paper and I love it https://arxiv.org/abs/2605.31514
This paper argues that anthropomorphic attributes often ascribed to LLMs are not unique, demonstrating that simpler systems like Age of Empires II can exhibit similar perceived traits, and calls for explicit measurement criteria in AI behavior analysis.
@rohanpaul_ai: New Microsoft + York Univ paper argues that LLMs should not be treated as human-like without clear tests and narrower c…
A Microsoft and York University paper argues that attributing human-like attributes to LLMs is problematic due to flawed experimental designs, using Age of Empires II as an analogy to highlight measurement issues.
Persona-Assigned Large Language Models Exhibit Human-Like Motivated Reasoning
This paper investigates whether assigning personas to large language models induces human-like motivated reasoning, finding that persona-assigned LLMs show up to 9% reduced veracity discernment and are up to 90% more likely to evaluate scientific evidence in ways congruent with their induced political identity, with prompt-based debiasing largely ineffective.
How Well Do Large Language Models Capture Human Personality?
This paper systematically evaluates assumptions about LLM persona prompting and identifies 'persona manifold collapse,' where richer persona descriptions reduce behavioral diversity and simulation fidelity. The findings show that simple age-gender personas often outperform more detailed profiles.
## Language adaption as language models become integral part of society.
The article argues that as LLM-based AI becomes ubiquitous, language should adapt by creating new pronouns for AI, since neither human pronouns ('he/she') nor impersonal 'it' accurately reflect the unique relationship with language-capable non-human entities.