FATE-M

The FATE-M (Formal Algebra Theorem Evaluation - Medium) benchmark.

Overview

This benchmark contains 150 easy abstract algebra exercises collected from textbooks and formalized in Lean.

Benchmark Structure

Each Lean file consists of a single exercise, with:

One fully formalized statement,
A single sorry,
Appropriate open namespaces at the beginning,
Natural language annotations preceding the statement.

No new definitions are formalized in any of the files. For a complete list of exercises in both natural language and Lean code, please refer to the file FATE-M.pdf.

For users' convenience, we provide a JSON file (FATE-M.json) where each entry represents a problem and includes the following information:

Problem ID,
Formal statement (Lean code),
Natural language statement,
Benchmark source (FATE-M).

The initial version of this benchmark uses leanprover/lean4:v4.16.0. For each supported minor version of Lean, a corresponding and updated iteration of the benchmark is available via the repository's tags.

Additional Information

This is a refactored version of the benchmark referenced in the paper REAL-Prover: Retrieval-Augmented Lean Prover for Mathematical Reasoning.

FATE-M

FATE-M

Overview

Benchmark Structure

Lean Toolchain

Additional Information