research-level

#research-level

Humans outperform AI at this highly rigorous mathematics test

Reddit r/singularity ↗ · 3d ago Cached

The First Proof test evaluated four AI systems on novel research-level math problems, with the top model scoring only 6 out of 10, demonstrating that current AI still lags behind top mathematicians in rigorous reasoning.

0 favorites 0 likes

research-level

Humans outperform AI at this highly rigorous mathematics test

Submit Feedback