Summary of Deepseek-prover-v1.5: Harnessing Proof Assistant Feedback For Reinforcement Learning and Monte-carlo Tree Search, by Huajian Xin et al.
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Searchby Huajian Xin, Z.Z.…