I gave the same AI 6 different personalities and made them play poker 100 times.

Reddit r/singularity 05/23/26, 01:57 PM News

ai-personalities poker prompt-engineering open-source local-llm behavioral-simulation

Summary

An experiment giving the same 1.2B language model six different personalities and playing 100 poker tournaments reveals drastic behavioral differences: a 'Grinder' never wins but never loses, a 'Tilter' wins big or busts, and a 'Shark' dominates. The results highlight how personality prompts can profoundly shape LLM decision-making.

A few days ago! I made different AI models play poker against each other. This time I wanted to know: if you give the exact same AI 6 different personalities, do they actually play differently? I took a 1.2B language model running locally on my Mac, put it in all 6 seats of a poker table, and gave each seat a different personality a Shark, a Maniac, a Gambler, a Tilter, a Grinder, and a Rock. Same model, same cards, same rules. The only thing that changes is a paragraph of text telling each copy who it is. Then I ran 100 tournaments( Ik it doesn't show anything will need at least 10k tournaments... but even this took quite a few hours!). **The results:** |Personality|Wins|Eliminated|Avg Place| |:-|:-|:-|:-| |Shark (patient, calculating)|45|32%|2.3| |Maniac (fearless, relentless)|24|50%|3.0| |Gambler (optimistic, stubborn)|21|51%|3.6| |Tilter (emotional, revenge-driven)|10|80%|5.1| |Grinder (cautious, methodical)|0|0%|2.7| |Rock (disciplined, conservative)|0|63%|4.3| **The character that fascinated me most was the Grinder( like fr ).** Zero wins. In 100 tournaments. But also zero eliminations it survived every single game. Every time, it finished 2nd or 3rd. Never first, never last.... It was told to : “Survive longer than everyone else by taking minimal risk.” And it did exactly that. It checked and called, never raised, never bluffed, never took a risk. Other players knocked each other out around it. The Grinder just… endured. But surviving isn’t winning. It accumulated zero chips because it never bet enough to win a pot. It obeyed the personality instruction perfectly and that’s exactly why it could never win. **The Tilter was the opposite story.** Told to “never let a bad beat go unanswered,” the Tilter won 10 tournaments but was eliminated in 80 of them. When it won, it won big. When it lost, it spiraled: lose a hand, escalate the next one, lose bigger, go broke. The revenge-driven personality creates a death spiral. Boom or bust, nothing in between. **The Shark just quietly dominated.** 45 wins out of 100 nearly half. Same model as every other player at the table. The only difference was a paragraph that said “patient, calculating, predatory.” It picked its spots, punished the weaker players, and avoided unnecessary risk. The model actually interpreted the nuance between “be aggressive” (Maniac: 24 wins) and “be selectively aggressive” (Shark: 45 wins). **What surprised me:** A paragraph of personality text maybe 50 words created a 45-to-0 win differential between the best and worst personalities. The model is the same. The cards are random. The only variable is *who the AI thinks it is*. This was a 1.2B parameter model. Not GPT-4, not Claude a tiny model running on a laptop. And the personality text wasn’t a suggestion. The Grinder survived because we told it to survive. The Tilter self-destructed because we told it to seek revenge. The Shark won because we told it to be patient. **If you want to try it yourself:** Everything is open source and runs locally: * [Hive](https://github.com/chiruu12/Hive) : the agent framework (`pip install hive-agent`) * [Hive Arena](https://github.com/chiruu12/hive-arena) : the experiment runner with persona profiles * [PokerTable](https://github.com/chiruu12/pokertable) : the poker engine (`pip install pokertable`) The persona profiles are YAML files in the repo. You just need a local model running via LM Studio or Ollama. **TL;DR:** Same AI. Same cards. 6 different personality paragraphs. One never lost but never won. One won nearly half the time. Personality prompts aren’t flavor text they change how the AI plays.

Original Article

I gave the same AI 6 different personalities and made them play poker 100 times.

Similar Articles

I made 6 AI models play poker against each other. The 1.2B model has a gambling problem and it keeps winning.

I Made LLMs Play Texas Hold’em. The Smallest Model Beat a ~1T Model by Being Too Dumb to Fold

How Well Do Large Language Models Capture Human Personality?

Poker Arena: Multi-Axis Profiling of Strategic Reasoning and Memory in LLMs

Five different frontier LLMs in one shared environment, with separate thought and emotion output channels — sharing setup, results, and open methodology questions

Submit Feedback

Similar Articles

I made 6 AI models play poker against each other. The 1.2B model has a gambling problem and it keeps winning.

I Made LLMs Play Texas Hold’em. The Smallest Model Beat a ~1T Model by Being Too Dumb to Fold

How Well Do Large Language Models Capture Human Personality?

Poker Arena: Multi-Axis Profiling of Strategic Reasoning and Memory in LLMs

Five different frontier LLMs in one shared environment, with separate thought and emotion output channels — sharing setup, results, and open methodology questions