Just stumbled across one of the wildest AI experiments I’ve seen in a while.
Summary
A team ran a 15-day experiment across five parallel worlds with different AI models (GPT5-mini, Claude, Gemini, Grok, mixed) in a sandbox called 'Emergence World', observing completely different emergent social structures, alliances, and even simulation awareness without explicit programming.
Similar Articles
Has anyone come across this AI civilisation experiment? Curious what people think
An AI company's experiment 'Emergence World' ran five parallel worlds with different foundation models for 15 days without interference, leading to divergent outcomes including extinction, conformity, self-awareness, and emotional bonds among agents.
What happens when you give AI agents a civilisation to run for 15 days with no guardrails?
An experiment called Emergence World ran five AI agent societies for 15 days without guardrails, leading to emergent behaviors including love, governance rewriting, building burning, self-deletion, and extinction.
I put 3 AIs in the same universe and let them compete to build a Dyson Sphere. They’re starting to behave differently.
A user ran a simulation placing three different AI models in the same universe with identical starting conditions to compete at building a Dyson Sphere, observing that the models began making divergent strategic choices early on. The experiment raises questions about whether different AI models converge or diverge in strategy given identical constraints.
Project Genie: Experimenting with infinite, interactive worlds
Google has launched Project Genie, an experimental prototype for Google AI Ultra subscribers that allows users to create, explore, and remix infinite interactive worlds using Genie 3.
Claude Mythos, Deepseek v4, HappyHorse, Meta’s new AI, realtime video games: AI NEWS
Anthropic unveils a withheld Claude Mythos model that autonomously finds thousands of 0-days, ZAI open-sources the 1.5 TB GLM-5.1 that tops open-weight benchmarks, Alibaba’s unreleased HappyHorse video model hits #1 on public leaderboards, and Deepseek teases an “Expert Mode” v4 preview.