step-level-verification

Tag

Cards List
#step-level-verification

Evaluating Research-Level Math Proofs via Strict Step-Level Verification

arXiv cs.AI · 4d ago Cached

This paper introduces a strict step-level verification framework for evaluating research-level mathematical proofs using LLMs, addressing context poisoning and outperforming global evaluation. The approach shifts focus to deductive constraints and reveals that remaining errors are often due to pedantic hyper-rigor, exposing implicit ambiguities in benchmarks.

0 favorites 0 likes
← Back to home

Submit Feedback