GPT 5.5 Cannot Do These Puzzles
Summary
GPT 5.5 fails to solve Jane Street Puzzles that its predecessor could not handle either, suggesting continued limitations in AI reasoning.
Similar Articles
GPT-5.5 was used to flag fatal errors in FrontierMath problems
GPT-5.5 was used by Epoch to identify fatal errors in approximately one-third of the FrontierMath benchmark problems, demonstrating the model's capability to sanity-check evaluation standards.
On SWEBench Pro, 68.5% of GPT 5.5’s failures were caused by broken or incorrect test cases, totaling 28.9% of the entire benchmark
An analysis reveals that 28.9% of GPT 5.5's failures on SWEBench Pro are due to broken or incorrect test cases, and similar issues affect other major AI benchmarks, raising concerns about the accuracy of current evaluation methods.
GPT-5.4 Thinking System Card
OpenAI releases GPT-5.4 Thinking, the latest reasoning model in the GPT-5 series with enhanced safety mitigations, notably the first general-purpose model implementing comprehensive cybersecurity safeguards.
Puzzled By ChatGPT? No more! A Jigsaw Puzzle to Promote AI Literacy and Awareness
This paper introduces a jigsaw puzzle designed with comic-based infographics to promote AI literacy, explaining the workings, capabilities, limitations, and societal implications of generative AI like ChatGPT in an engaging, hands-on format.
Fields Medal winning mathematician Timothy Gowers used GPT5.5 Pro to solve open problems, believes mathematical research will face a ‘crisis’ very soon with current rate of progress
Fields Medalist Timothy Gowers reports using GPT5.5 Pro to solve open mathematical problems and predicts an imminent crisis in mathematical research due to rapid AI progress.