Tag
Introduces K-BrowseComp, a Korean web-browsing agent benchmark with 400 problems, revealing substantial performance gaps compared to English benchmarks and underscoring the need for robust Korean AI development.