Goedel-Prover

Automated mathematical proof AI theorem proof formal mathematical model

Goedel-Prover is an open source LLM launched by Princeton, Tsinghua and other institutions. It can transform mathematical problems into formal proofs and significantly improve the proof ability of automation theorems.

No Resources Yet

Author:LoRA

Inclusion Time:28 Mar 2025

Downloads:5311

Pricing Model:Free

Introduction

What is Goedel-Prover ?

Goedel-Prover is an open source AI model that focuses on the form proof generation of automated mathematical problems. The core objectives of this model are:
✅ Convert natural language math problems to formal language (such as Lean 4)
✅ Automatically generate complete mathematical proofs to solve the problem of scarcity of formal mathematical statements and proofs

✅ Improve the automatic proof ability of mathematical theorems and promote the development of AI in the field of mathematical reasoning

Goedel-Prover has achieved breakthrough results in several benchmarks, such as:
1.miniF2F benchmark test: success rate is 57.6%, surpassing all previous open source models
2.PutnamBench: Successfully solved 7 difficult math problems
3.Lean Workbook: Automatically generate nearly 30,000 formal proofs

Core functions

✅ 1. Formal translation

Automatically analyze natural language mathematical problems and accurately convert them to Lean 4 formal languages
Ensure the logical integrity and mathematical rigor of translation

✅ 2. Automatic theorem proof

Generate a complete mathematical proof process based on AI automatic reasoning
Suitable for advanced mathematics, computer science and other fields

✅ 3. Expert Iteration Training

Using expert iterative methods to perform multiple rounds of optimization to continuously improve mathematical proof ability
Verify the correctness of the proof using the Lean compiler to ensure that the generated proof is rigorous

✅ 4. Large-scale dataset training

Combining Numina, Lean Workbook, Mathlib4 and other data sets to enhance generalization capabilities
Continuously expand the mathematical problem bank during the training process to adapt to different mathematical fields

Technical Principles

✅ 1. Formal translation

Formalizer A & B: Two different styles of mathematical translation to improve diversity
Compile Correctness (CC) Test: Ensure formal statements comply with Lean syntax
Fidelity and Completeness (FC) Test: Ensure that translations accurately express original mathematical problems

✅ 2. Expert iterative training

Use DeepSeek-Prover-V1.5-RL to generate initial proof
Verify correctness through Lean compiler and filter high-quality proofs
Training data is continuously updated, model is continuously optimized, and automatic proof ability is improved

✅ 3. Dataset extension

Combining external datasets such as Numina and Mathlib4, enriching the mathematical theorem library
Gradually increase Lean Workbook data during training to improve adaptability in different math fields

Application scenarios

Mathematical research: Help mathematicians to verify complex theorems and accelerate research progress Mathematical teaching: Provide teachers with detailed mathematical proofs to assist students in understanding mathematical logic software verification: used to verify the correctness of software algorithms, improve safety and reliability
AI algorithm verification: Ensure the mathematical theoretical foundation of AI-related algorithms is reasonable and rigorous interdisciplinary research: Supporting the application of mathematical reasoning in physics, engineering and other disciplines

Goedel-Prover Project Address

GitHub repository: Goedel-Prover
HuggingFace Model Library: Model Download
arXiv Paper: Technical Report

Guess you like

Goedel-Prover

Goedel-Prover is an open source LLM launched by Princeton, Tsinghua and other institutions. It can transform mathematical problems into formal proofs and significantly improve the proof ability of automation theorems.

Automated mathematical proof AI theorem proof
Neo-1

Discover how Neo-1, VantAI's groundbreaking AI model, revolutionizes molecular design and drug development with precise structure predictions and innovative features.

Neo-1 AI model molecular design AI
Step Reasoner mini

Are you still worried about complex mathematical logic problems and boring text creation? Step R-mini helps you solve it easily!

AI reasoning model a reasoning model with both liberal arts and science capabilities
Microsoft Phi-4

Microsoft Phi-4 is an artificial intelligence (AI) framework developed by Microsoft for automated training and inference of deep learning and reinforcement learning tasks.

Small language models mathematics

Selected columns

Cursor ai Tutorial

Cursor is a powerful AI programming editor that integrates intelligent completion, code interpretation and debugging functions. This article explains the core functions and usage methods of Cursor in detail.
ComfyUI Tutorial

ComfyUI is an efficient UI development framework. This tutorial details the features, components and practical tips of ComfyUI.
Grok Tutorial

Grok is an AI programming assistant. This article introduces the functions, usage methods and practical skills of Grok to help you improve programming efficiency.
Gemini Tutorial

Gemini is a multimodal AI model launched by Google. This guide analyzes Gemini's functions, application scenarios and usage methods in detail.
Second Me Tutorial

Welcome to the Second Me Creation Experience Page! This tutorial will help you quickly create and optimize your second digital identity.