Tag
Mistral AI releases Leanstral 1.5, a 6B active parameter model for Lean 4 proof engineering, achieving state-of-the-art results on multiple formal verification benchmarks and uncovering real-world bugs, fully open-sourced under Apache-2.0.
This paper presents a case study in reformalization, transferring the Jordan Curve Theorem between proof assistants (Mizar to Lean, HOL Light to Lean and Agda) using LLMs, and analyzes pipeline design choices for practical reformalization.
A detailed blog post presenting a fully commented proof of the Fundamental Theorem of Arithmetic in Agda, intended for intermediate learners of the proof assistant.
This paper applies the VGPT-RSI AI system to produce formally verified partial results related to the Riemann Hypothesis, including boundary certificates and finite Lagarias inequalities, while explicitly identifying remaining mathematical obstructions.
hax is a tool for translating Rust code into formal languages like F*, Rocq, and Lean for high-assurance verification.
Google new paper proposes the LEAP framework, which decomposes math problems into goal trees, learns from Lean verifier feedback, and improves LLM accuracy on math competition problems from 10% to 70%. It solves all 12 problems of Putnam 2025 and surpasses dedicated gold-medal-level systems on IMO-style benchmarks.
This paper describes a formally verified library of mathematical finance in Lean 4, containing over 200 theorems covering measure-theoretic foundations through derivative pricing, and includes a faithfulness audit to classify results by how their Lean statement relates to the claimed mathematics.