GPT 5.5 Cannot Do These Puzzles

Reddit r/singularity 05/14/26, 01:21 AM News

gpt-5 puzzles reasoning limitations ai-testing jane-street

Summary

GPT 5.5 fails to solve Jane Street Puzzles that its predecessor could not handle either, suggesting continued limitations in AI reasoning.

[Jane Street Puzzles](https://preview.redd.it/lrrv2kgj801h1.png?width=864&format=png&auto=webp&s=2866307b063b7374de00da40e3f0db2c60d7cf21) Can any of you get it to find the solution? I used GPT 5.5 extended thinking and xhigh. Maybe pro can do it. Cant do last months problem either.

Original Article

Similar Articles

GPT-5.5 was used to flag fatal errors in FrontierMath problems

Reddit r/singularity

GPT-5.5 was used by Epoch to identify fatal errors in approximately one-third of the FrontierMath benchmark problems, demonstrating the model's capability to sanity-check evaluation standards.

On SWEBench Pro, 68.5% of GPT 5.5’s failures were caused by broken or incorrect test cases, totaling 28.9% of the entire benchmark

Reddit r/ArtificialInteligence

An analysis reveals that 28.9% of GPT 5.5's failures on SWEBench Pro are due to broken or incorrect test cases, and similar issues affect other major AI benchmarks, raising concerns about the accuracy of current evaluation methods.

GPT-5.4 Thinking System Card

OpenAI Blog

OpenAI releases GPT-5.4 Thinking, the latest reasoning model in the GPT-5 series with enhanced safety mitigations, notably the first general-purpose model implementing comprehensive cybersecurity safeguards.

Puzzled By ChatGPT? No more! A Jigsaw Puzzle to Promote AI Literacy and Awareness

arXiv cs.CL

This paper introduces a jigsaw puzzle designed with comic-based infographics to promote AI literacy, explaining the workings, capabilities, limitations, and societal implications of generative AI like ChatGPT in an engaging, hands-on format.

Fields Medal winning mathematician Timothy Gowers used GPT5.5 Pro to solve open problems, believes mathematical research will face a ‘crisis’ very soon with current rate of progress