AI Papers Reader

Personalized digests of latest AI research

View on GitHub

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

DeepSeek-Prover-V1.5 is an open-source language model (LM) developed by DeepSeek-AI for theorem proving in Lean 4. It significantly improves upon its predecessor, DeepSeek-Prover-V1, by incorporating several key advancements, including an enhanced training regime and an innovative Monte-Carlo tree search (MCTS) algorithm. These improvements result in a new state-of-the-art performance on both the miniF2F (high school level) and ProofNet (undergraduate level) benchmarks, achieving pass rates of 63.5% and 25.3%, respectively.

The new model tackles the challenges of formal theorem proving, where LMs must not only comprehend formal mathematical languages but also align their reasoning with rigorous specifications. Prior work on formal theorem provers using LMs typically focuses on two strategies:

DeepSeek-Prover-V1.5 takes a unified approach by combining the strengths of both strategies. Its key innovations include:

In addition to these key algorithmic innovations, DeepSeek-Prover-V1.5 also incorporates several enhancements in its training process:

DeepSeek-Prover-V1.5 is a significant advancement in formal theorem proving, pushing the boundaries of what is possible with LMs. Its unified approach to proof generation, combined with its innovative MCTS algorithm and enhanced training regime, makes it a powerful tool for tackling complex formal mathematics problems. It is expected to have a significant impact on future research in the field.