balatro

Tag

Cards List
#balatro

Evalatro: an open benchmark where LLMs play the real Balatro

Reddit r/LocalLLaMA · 4d ago

Evalatro is an open benchmark where LLMs play the real game Balatro via a text-based interface, with fixed seeds, a public leaderboard, and the goal of clearing Ante 12. Early results show models struggle, with none reaching the target.

0 favorites 0 likes
← Back to home

Submit Feedback