Complex Mathematics

OpenAI beats Google, Meta, and Grok in all-AI poker tournament



  • OpenAI’s o3 model won a five-day poker tournament of nine AI chatbots
  • The o3 model won by playing the most consistent game
  • Most top language models handled poker well, but struggled with bluffing, position, and basic math

In a digital showdown unlike anything ever dealt at the felt, nine of the world’s most powerful large language models spent five days locked in a high-stakes poker match.

OpenAI’s o3, Anthropic’s Claude Sonnet 4.5, X.ai’s Grok, Google‘s Gemini 2.5 Pro, Meta’s Llama 4, DeepSeek R1, Kimi K2 from Moonshot AI, Magistral from Mistral AI, and Z.AI’s GLM 4.6 played thousands of hands of no-limit Texas hold ’em at $10 and $20 tables with $100,000 bankrolls apiece.





Source link