program-verification

Tag

Cards List
#program-verification

Agentic Proving for Program Verification

arXiv cs.AI · 2026-05-25 Cached

This paper evaluates Claude Code in an agentic proving framework on the Clever benchmark for program verification, achieving over 98% success in specification generation and end-to-end verification, revealing that existing benchmarks may be insufficient for evaluating modern agentic provers.

0 favorites 0 likes
← Back to home

Submit Feedback