Agent-based models for the evolution of morphological alternation patterns

arXiv cs.CL Papers

Summary

This paper presents multi-agent simulations of the emergence of morphological alternation patterns (like 'go/went') in language, using an AI Historical Linguist (LLM-driven) to evaluate plausibility of evolved morphologies against real languages.

arXiv:2606.12748v1 Announce Type: new Abstract: Why is the past of English "go" the apparently unrelated "went"? Such alternations are frequent in languages. They neither aid communication nor learnability, yet they can be persistent, surviving over centuries or millennia. We present a multi-agent simulation of the emergence of morphological stem and inflection alternations. Alternate forms arise by phonological changes or, as with "go/went", from lexical alternatives associated with a subset of the population. When an agent 'hears' another agent use a novel form for a slot in the paradigm of a word (say, the past tense of go), they will with some probability adopt that form, possibly spreading its use to other slots in the paradigm that shared the same original form. Thus alternative forms can spread through the population and become entrenched as stem or inflectional marker alternants. Unlike many previous computational studies, our system allows for naturalistic lexical forms, realistic phonological rules, lexicons with hundreds or thousands of entries, and agent populations in the tens or hundreds. It supports several network topologies, diffusion patterns and agent adoption policies. One issue with such simulations is evaluation: how realistic is the resulting morphology compared to those of real languages? We introduce the AI Historical Linguist, a novel Large Language Model-driven system that models a debate between two historical linguists. We use this to compare a set of real language morphologies, disguised morphologies, and experimentally evolved morphologies. The results suggest that among the factors that favor more plausible morphologies are scale-free social networks and random Bernoulli adoption of forms. We also present three case studies modeling attested historical changes, allowing us to test what might have happened if history had been different. All code and data are released.
Original Article
View Cached Full Text

Cached at: 06/12/26, 08:50 AM

# Agent-based models for the evolution of morphological alternation patterns
Source: [https://arxiv.org/abs/2606.12748](https://arxiv.org/abs/2606.12748)
[View PDF](https://arxiv.org/pdf/2606.12748)

> Abstract:Why is the past of English "go" the apparently unrelated "went"? Such alternations are frequent in languages\. They neither aid communication nor learnability, yet they can be persistent, surviving over centuries or millennia\. We present a multi\-agent simulation of the emergence of morphological stem and inflection alternations\. Alternate forms arise by phonological changes or, as with "go/went", from lexical alternatives associated with a subset of the population\. When an agent 'hears' another agent use a novel form for a slot in the paradigm of a word \(say, the past tense of go\), they will with some probability adopt that form, possibly spreading its use to other slots in the paradigm that shared the same original form\. Thus alternative forms can spread through the population and become entrenched as stem or inflectional marker alternants\. Unlike many previous computational studies, our system allows for naturalistic lexical forms, realistic phonological rules, lexicons with hundreds or thousands of entries, and agent populations in the tens or hundreds\. It supports several network topologies, diffusion patterns and agent adoption policies\. One issue with such simulations is evaluation: how realistic is the resulting morphology compared to those of real languages? We introduce the AI Historical Linguist, a novel Large Language Model\-driven system that models a debate between two historical linguists\. We use this to compare a set of real language morphologies, disguised morphologies, and experimentally evolved morphologies\. The results suggest that among the factors that favor more plausible morphologies are scale\-free social networks and random Bernoulli adoption of forms\. We also present three case studies modeling attested historical changes, allowing us to test what might have happened if history had been different\. All code and data are released\.

## Submission history

From: Richard Sproat \[[view email](https://arxiv.org/show-email/3a297006/2606.12748)\] **\[v1\]**Wed, 10 Jun 2026 23:26:44 UTC \(5,283 KB\)

Similar Articles

HawkesLLM: Semantic Uncertainty Propagation in Agentic Text Simulation

arXiv cs.CL

This paper introduces HawkesLLM, a framework that models semantic uncertainty propagation in multi-step agentic text simulations by combining a multivariate Hawkes process for temporal influence and memory selection with a language model for text generation. Evaluation on a GDELT news-cascade case study shows improved late-stage semantic alignment under compact prompt-memory constraints.

Emergence of grounded compositional language in multi-agent populations

OpenAI Blog

OpenAI researchers propose a multi-agent learning environment where agents develop a grounded compositional language with defined vocabulary and syntax to achieve goals. The study also observes emergence of non-verbal communication like pointing when language is unavailable.