lean

Tag

Cards List
#lean

@rohanpaul_ai: Google DeepMind's new paper. Shows that AI can now search formal mathematics proofs, but only inside carefully constrai…

X AI KOLs Following · 2026-05-22 Cached

Google DeepMind's new paper introduces AlphaProof Nexus, an AI system that combines an LLM with the Lean proof checker to search for formal proofs in constrained mathematical domains. The system solves several unsolved problems from the Erdős and OEIS sets, demonstrating a new division of labor where the AI proposes proof candidates and the verifier enforces correctness.

0 favorites 0 likes
#lean

Using algebra and LLMs to verify a flight-plan bug fix in Lean

Lobsters Hottest · 2026-05-19 Cached

A developer uses LLMs and algebraic reformulation to formally verify a bug fix for the 2023 UK air traffic control meltdown in the Lean proof assistant, finding that LLMs are great at grinding proofs but poor at specifications.

0 favorites 0 likes
#lean

@VitalikButerin: Many people have claimed that with AI-assisted bug finding, secure code (and hence trustless anything) will be impossib…

X AI KOLs Following · 2026-05-18 Cached

Vitalik Buterin shares an optimistic take on AI-assisted formal verification as a path to secure, trustless code, linking to his blog post explaining the basics of formal verification using Lean.

0 favorites 0 likes
#lean

I don't think AI will make your processes go faster

Hacker News Top · 2026-05-17 Cached

The author argues that AI will not necessarily accelerate processes because bottlenecks often originate from unclear requirements upstream, not from development speed alone.

0 favorites 0 likes
#lean

MathAtlas: A Benchmark for Autoformalization in the Wild

arXiv cs.AI · 2026-05-15 Cached

MathAtlas is a large-scale benchmark for autoformalization of graduate-level mathematics, containing ~52k theorems and definitions extracted from 103 textbooks, with a mathematical dependency graph of ~178k relations. Experiments show state-of-the-art models achieve at most 9.8% correctness, highlighting the difficulty.

0 favorites 0 likes
#lean

Signal Shot: a project to verify the Signal protocol and its Rust implementation using Lean

Lobsters Hottest · 2026-04-21 Cached

Signal Shot is a major formal verification initiative to verify the Signal protocol and its Rust implementation using Lean, combining advances in Rust-to-Lean translation (Aeneas), mathematical foundations (Mathlib/CSLib), automated tactics (grind/SymM), and AI-assisted formalization. This represents a significant test of whether Lean can scale from pure mathematics to deployed real-world software systems.

0 favorites 0 likes
← Previous
← Back to home

Submit Feedback