DeepSeek-Prover-V2 is an open-source large language model designed for formal theorem proving in Lean 4, with a sophisticated data collection and reinforcement learning process for enhancing formal mathematical reasoning.
Utilizes DeepSeek-V3 to decompose complex mathematical problems into subgoals and formalizes these steps for improved proof generation.
Implement a cold-start training using synthesized data to better bridge informal reasoning with formal proof construction.
Offers models in two sizes: 7B and 671B parameters for various computational needs.
Includes a benchmark dataset with 325 formalized mathematical problems for comprehensive evaluation.
Easily deploy and utilize the model using Hugging Face's Transformers library for inference.