Current location: Home> AI Model> Reinforcement Learning
Goedel-Prover

Goedel-Prover

Goedel-Prover is an open source LLM launched by Princeton, Tsinghua and other institutions. It can transform mathematical problems into formal proofs and significantly improve the proof ability of automation theorems.
Author:LoRA
Inclusion Time:28 Mar 2025
Downloads:5311
Pricing Model:Free
Introduction

What is Goedel-Prover ?

Goedel-Prover is an open source AI model that focuses on the form proof generation of automated mathematical problems. The core objectives of this model are:
✅ Convert natural language math problems to formal language (such as Lean 4)
✅ Automatically generate complete mathematical proofs to solve the problem of scarcity of formal mathematical statements and proofs

✅ Improve the automatic proof ability of mathematical theorems and promote the development of AI in the field of mathematical reasoning

Goedel-Prover.jpg

Goedel-Prover has achieved breakthrough results in several benchmarks, such as:
1.miniF2F benchmark test: success rate is 57.6%, surpassing all previous open source models
2.PutnamBench: Successfully solved 7 difficult math problems
3.Lean Workbook: Automatically generate nearly 30,000 formal proofs


Core functions

✅ 1. Formal translation

  • Automatically analyze natural language mathematical problems and accurately convert them to Lean 4 formal languages

  • Ensure the logical integrity and mathematical rigor of translation

✅ 2. Automatic theorem proof

  • Generate a complete mathematical proof process based on AI automatic reasoning

  • Suitable for advanced mathematics, computer science and other fields

✅ 3. Expert Iteration Training

  • Using expert iterative methods to perform multiple rounds of optimization to continuously improve mathematical proof ability

  • Verify the correctness of the proof using the Lean compiler to ensure that the generated proof is rigorous

✅ 4. Large-scale dataset training

  • Combining Numina, Lean Workbook, Mathlib4 and other data sets to enhance generalization capabilities

  • Continuously expand the mathematical problem bank during the training process to adapt to different mathematical fields

Technical Principles

1. Formal translation

  • Formalizer A & B: Two different styles of mathematical translation to improve diversity

  • Compile Correctness (CC) Test: Ensure formal statements comply with Lean syntax

  • Fidelity and Completeness (FC) Test: Ensure that translations accurately express original mathematical problems

2. Expert iterative training

  • Use DeepSeek-Prover-V1.5-RL to generate initial proof

  • Verify correctness through Lean compiler and filter high-quality proofs

  • Training data is continuously updated, model is continuously optimized, and automatic proof ability is improved

3. Dataset extension

  • Combining external datasets such as Numina and Mathlib4, enriching the mathematical theorem library

  • Gradually increase Lean Workbook data during training to improve adaptability in different math fields


Application scenarios

Mathematical research: Help mathematicians to verify complex theorems and accelerate research progress Mathematical teaching: Provide teachers with detailed mathematical proofs to assist students in understanding mathematical logic software verification: used to verify the correctness of software algorithms, improve safety and reliability
AI algorithm verification: Ensure the mathematical theoretical foundation of AI-related algorithms is reasonable and rigorous interdisciplinary research: Supporting the application of mathematical reasoning in physics, engineering and other disciplines

Goedel-Prover Project Address

GitHub repository: Goedel-Prover
HuggingFace Model Library: Model Download
arXiv Paper: Technical Report

Guess you like
  • Goedel-Prover

    Goedel-Prover

    Goedel-Prover is an open source LLM launched by Princeton, Tsinghua and other institutions. It can transform mathematical problems into formal proofs and significantly improve the proof ability of automation theorems.
    Automated mathematical proof AI theorem proof
  • Neo-1

    Neo-1

    Discover how Neo-1, VantAI's groundbreaking AI model, revolutionizes molecular design and drug development with precise structure predictions and innovative features.
    Neo-1 AI model molecular design AI
  • Step Reasoner mini

    Step Reasoner mini

    Are you still worried about complex mathematical logic problems and boring text creation? Step R-mini helps you solve it easily!
    AI reasoning model a reasoning model with both liberal arts and science capabilities
  • Microsoft Phi-4

    Microsoft Phi-4

    Microsoft Phi-4 is an artificial intelligence (AI) framework developed by Microsoft for automated training and inference of deep learning and reinforcement learning tasks.
    Small language models mathematics
Selected columns
  • Cursor ai Tutorial

    Cursor ai Tutorial

    Cursor is a powerful AI programming editor that integrates intelligent completion, code interpretation and debugging functions. This article explains the core functions and usage methods of Cursor in detail.
  • ComfyUI Tutorial

    ComfyUI Tutorial

    ComfyUI is an efficient UI development framework. This tutorial details the features, components and practical tips of ComfyUI.
  • Grok Tutorial

    Grok Tutorial

    Grok is an AI programming assistant. This article introduces the functions, usage methods and practical skills of Grok to help you improve programming efficiency.
  • Gemini Tutorial

    Gemini Tutorial

    Gemini is a multimodal AI model launched by Google. This guide analyzes Gemini's functions, application scenarios and usage methods in detail.
  • Second Me Tutorial

    Second Me Tutorial

    Welcome to the Second Me Creation Experience Page! This tutorial will help you quickly create and optimize your second digital identity.