lean-4

Tag

Cards List
#lean-4

@logic_int: NEW: Aleph Prover has formalized OpenAI’s disproof of Paul Erdős’ planar unit problem. We are releasing the formalizati…

X AI KOLs Following · 2026-05-28 Cached

Aleph Prover has formalized OpenAI's disproof of Paul Erdős' planar unit problem in Lean 4 and released it as open source for independent validation, demonstrating AI's role in accelerating mathematical research with verifiable proof data.

0 favorites 0 likes
#lean-4

Agentic Proving for Program Verification

arXiv cs.AI · 2026-05-25 Cached

This paper evaluates Claude Code in an agentic proving framework on the Clever benchmark for program verification, achieving over 98% success in specification generation and end-to-end verification, revealing that existing benchmarks may be insufficient for evaluating modern agentic provers.

0 favorites 0 likes
#lean-4

OProver: A Unified Framework for Agentic Formal Theorem Proving

Hugging Face Daily Papers · 2026-05-17 Cached

OProver is a unified framework for agentic formal theorem proving in Lean 4 that iteratively improves proof generation through training with verified proofs and compiler feedback, achieving state-of-the-art results on multiple benchmarks.

0 favorites 0 likes
#lean-4

@AnimaAnandkumar: TorchLean codebase is now available! TorchLean is a Lean 4 framework for verified neural-network software. It supports …

X AI KOLs Following · 2026-05-11 Cached

TorchLean is a newly released Lean 4 framework that enables formal verification of neural network software, featuring typed tensors, verified autograd, PyTorch interoperability, and GPU execution. The release expands support to modern architectures like diffusion models, GPT-style transformers, and state-space models, bridging practical ML workflows with mathematical proof checking.

0 favorites 0 likes
#lean-4

Discover and Prove: An Open-source Agentic Framework for Hard Mode Automated Theorem Proving in Lean 4

arXiv cs.CL · 2026-04-20 Cached

This paper introduces Discover and Prove (DAP), an open-source agentic framework for automated theorem proving in Lean 4 that tackles 'Hard Mode' problems where the answer must be discovered independently before formal proof construction. The work releases new Hard Mode benchmark variants and achieves state-of-the-art results while revealing a significant gap between LLM answer accuracy (>80%) and formal prover success (<10%).

0 favorites 0 likes
← Back to home

Submit Feedback