Blog

`lf-lean`: The frontier of verified software engineering

February 5, 2026

We present lf-lean, a verified translation of all 1,276 statements of the Logical Foundations textbook from Rocq to Lean, produced by frontier AI with ~2 person-days of human effort versus an estimated ~2.75 person-years manually (a 350x speed-up). We achieve this through task-level specification generators: because many software transformations are semantics-preserving, correctness can be defined once for an entire task class and checked automatically across all instances and codebases. This scales human oversight from 𝒪(𝓃) to 𝒪(1) regardless of program complexity. Placed on METR’s time horizon graph, our result suggests verified software engineering is advancing faster than expected.

lf-lean: The frontier of verified software engineering

Systematically generating tests that would have caught Anthropic’s top‑K bug

`lf-lean`: The frontier of verified software engineering