browsing-agents

Tag

Cards List
#browsing-agents

BrowseComp: a benchmark for browsing agents

OpenAI Blog · 2025-04-10 Cached

OpenAI released BrowseComp, a benchmark of 1,266 challenging problems designed to measure AI agents' ability to locate hard-to-find information across the internet, available in their simple evals GitHub repository.

0 favorites 0 likes
← Back to home

Submit Feedback