FATE-M
The FATE-M (Formal Algebra Theorem Evaluation - Medium) benchmark.
Overview
This benchmark contains 150 easy abstract algebra exercises collected from textbooks and formalized in Lean.
Benchmark Structure
Each Lean file consists of a single exercise, with:
- One fully formalized statement,
- A single
sorry
, - Appropriate open namespaces at the beginning,
- Natural language annotations preceding the statement.
No new definitions are formalized in any of the files. For a complete list of exercises in both natural language and Lean code, please refer to the file FATE-M.pdf.
For users' convenience, we provide a JSON file (FATE-M.json) where each entry represents a problem and includes the following information:
- Problem ID,
- Formal statement (Lean code),
- Natural language statement,
- Benchmark source (FATE-M).
Lean Toolchain
The initial version of this benchmark uses leanprover/lean4:v4.16.0
. This may change in future updates.
Additional Information
This is a refactored version of the benchmark referenced in the paper REAL-Prover: Retrieval-Augmented Lean Prover for Mathematical Reasoning.