mathematical-software-engineering

Tag

Cards List
#mathematical-software-engineering

Sorries Are Not the Hard Part: An Expert-Review Case Study of a Semi-Autonomous Formalization

arXiv cs.AI · 2d ago Cached

This paper presents a case study of using a large language model (Claude Code) to formalize Grothendieck's vanishing theorem in the Lean theorem prover. It finds that while agents can produce verified code, they struggle with definitions and API design, emphasizing the need for expert review beyond mere compilation.

0 favorites 0 likes
← Back to home

Submit Feedback