Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models
Kimina-Prover leverages test-time reinforcement learning search to enhance the formal reasoning capabilities of large language models. The research focuses on improving accuracy in complex mathematical and logical proofs.




