Agent-based models for the evolution of morphological alternation patterns

arXiv cs.CL 06/12/26, 04:00 AM Papers

agent-based-models morphological-alternation language-evolution multi-agent-simulation computational-linguistics large-language-model historical-linguistics

Summary

This paper presents multi-agent simulations of the emergence of morphological alternation patterns (like 'go/went') in language, using an AI Historical Linguist (LLM-driven) to evaluate plausibility of evolved morphologies against real languages.

arXiv:2606.12748v1 Announce Type: new Abstract: Why is the past of English "go" the apparently unrelated "went"? Such alternations are frequent in languages. They neither aid communication nor learnability, yet they can be persistent, surviving over centuries or millennia. We present a multi-agent simulation of the emergence of morphological stem and inflection alternations. Alternate forms arise by phonological changes or, as with "go/went", from lexical alternatives associated with a subset of the population. When an agent 'hears' another agent use a novel form for a slot in the paradigm of a word (say, the past tense of go), they will with some probability adopt that form, possibly spreading its use to other slots in the paradigm that shared the same original form. Thus alternative forms can spread through the population and become entrenched as stem or inflectional marker alternants. Unlike many previous computational studies, our system allows for naturalistic lexical forms, realistic phonological rules, lexicons with hundreds or thousands of entries, and agent populations in the tens or hundreds. It supports several network topologies, diffusion patterns and agent adoption policies. One issue with such simulations is evaluation: how realistic is the resulting morphology compared to those of real languages? We introduce the AI Historical Linguist, a novel Large Language Model-driven system that models a debate between two historical linguists. We use this to compare a set of real language morphologies, disguised morphologies, and experimentally evolved morphologies. The results suggest that among the factors that favor more plausible morphologies are scale-free social networks and random Bernoulli adoption of forms. We also present three case studies modeling attested historical changes, allowing us to test what might have happened if history had been different. All code and data are released.

Original Article

View Cached Full Text

Cached at: 06/12/26, 08:50 AM

# Agent-based models for the evolution of morphological alternation patterns
Source: [https://arxiv.org/abs/2606.12748](https://arxiv.org/abs/2606.12748)
[View PDF](https://arxiv.org/pdf/2606.12748)

> Abstract:Why is the past of English "go" the apparently unrelated "went"? Such alternations are frequent in languages\. They neither aid communication nor learnability, yet they can be persistent, surviving over centuries or millennia\. We present a multi\-agent simulation of the emergence of morphological stem and inflection alternations\. Alternate forms arise by phonological changes or, as with "go/went", from lexical alternatives associated with a subset of the population\. When an agent 'hears' another agent use a novel form for a slot in the paradigm of a word \(say, the past tense of go\), they will with some probability adopt that form, possibly spreading its use to other slots in the paradigm that shared the same original form\. Thus alternative forms can spread through the population and become entrenched as stem or inflectional marker alternants\. Unlike many previous computational studies, our system allows for naturalistic lexical forms, realistic phonological rules, lexicons with hundreds or thousands of entries, and agent populations in the tens or hundreds\. It supports several network topologies, diffusion patterns and agent adoption policies\. One issue with such simulations is evaluation: how realistic is the resulting morphology compared to those of real languages? We introduce the AI Historical Linguist, a novel Large Language Model\-driven system that models a debate between two historical linguists\. We use this to compare a set of real language morphologies, disguised morphologies, and experimentally evolved morphologies\. The results suggest that among the factors that favor more plausible morphologies are scale\-free social networks and random Bernoulli adoption of forms\. We also present three case studies modeling attested historical changes, allowing us to test what might have happened if history had been different\. All code and data are released\.

## Submission history

From: Richard Sproat \[[view email](https://arxiv.org/show-email/3a297006/2606.12748)\] **\[v1\]**Wed, 10 Jun 2026 23:26:44 UTC \(5,283 KB\)

Agent-based models for the evolution of morphological alternation patterns

Similar Articles

Emergent Languages in Populations of Language Model Agents: From Token Efficiency to Oversight Evasion

HawkesLLM: Semantic Uncertainty Propagation in Agentic Text Simulation

Auto-regressive LLMs are officially sleeping with the fishes (Yann LeCun was right)

Emergence of grounded compositional language in multi-agent populations

PACE: Two-Timescale Self-Evolution for Small Language Model Agents

Submit Feedback

Similar Articles

Emergent Languages in Populations of Language Model Agents: From Token Efficiency to Oversight Evasion

HawkesLLM: Semantic Uncertainty Propagation in Agentic Text Simulation

Auto-regressive LLMs are officially sleeping with the fishes (Yann LeCun was right)

Emergence of grounded compositional language in multi-agent populations

PACE: Two-Timescale Self-Evolution for Small Language Model Agents