gdpval

Tag

Cards List
#gdpval

Measuring the performance of our models on real-world tasks

OpenAI Blog · 2025-09-25 Cached

OpenAI introduces GDPval, a new evaluation framework measuring AI model performance on economically valuable, real-world tasks across 44 occupations in the top 9 US GDP-contributing industries. The benchmark includes 1,320 specialized tasks based on actual professional work products, representing a progression from academic benchmarks to more realistic occupational assessments.

0 favorites 0 likes
← Back to home

Submit Feedback