humanity's last exam current benchmarks thoughts?

Reddit r/singularity News

Summary

Discussion of recent AI model scores on the 'humanity's last exam' benchmark, noting improvement from GPT-4o's 2.7% in May 2024 to around 45% by June 2026, questioning the exam's difficulty.

so, some of the recent models have scored around 45 percent on that exam. This is on June 2026... but in 2024, May, gpt4o scored 2.7 percent. Now, to me, this seems like a good progress. But i wanted to ask, is the exam really that hard?
Original Article

Similar Articles