breadth-search

Tag

Cards List
#breadth-search

Ko-WideSearch: A Korean Breadth-Search Benchmark for Exhaustive Set Enumeration by Web Agents

arXiv cs.CL · 6d ago Cached

Introduces Ko-WideSearch, a Korean breadth-search benchmark for web agents that evaluates exhaustive set enumeration across 228 tables. Findings show agents have high item recall but struggle with row completion, especially for open-ended cells.

0 favorites 0 likes
← Back to home

Submit Feedback