political-facts

Tag

Cards List
#political-facts

PolitNuggets: Benchmarking Agentic Discovery of Long-Tail Political Facts

arXiv cs.AI · 2d ago Cached

PolitNuggets is a multilingual benchmark for evaluating large reasoning models within agentic frameworks on their ability to discover and synthesize long-tail political facts by constructing biographies for 400 global elites. The benchmark introduces evaluation protocols like FactNet and reveals that current systems struggle with fine-grained details and efficiency.

0 favorites 0 likes
← Back to home

Submit Feedback