model-progress

Tag

Cards List
#model-progress

@OpenAI: Let’s talk about evals. We’re always looking for better ways to measure and forecast model progress, especially as benc…

X AI KOLs · 8h ago Cached

OpenAI discusses the importance of evals (evaluations) for measuring and forecasting model progress, especially as benchmarks become saturated or gamed, featuring insights from Tejal Patwardhan and Andrew Mayne.

0 favorites 0 likes
← Back to home

Submit Feedback