In this episode of No Priors, the DeepMind AlphaProof team shares insights about their groundbreaking AI system designed to tackle complex mathematical problems. Notably, AlphaProof has successfully solved four out of six challenges from the prestigious International Mathematical Olympiad (IMO). By building on AlphaZero's reinforcement learning framework, the team has adapted it to search for mathematical proofs in a formal language. They discuss the significant challenges posed by the vast landscape of mathematics and introduce "test-time RL" as an innovative solution. While AlphaProof shows promise, the team acknowledges its limitations, such as the inability to create new mathematical theories. They also explore exciting future applications, including code verification and advancements in artificial general intelligence (AGI), highlighting the essential partnership between human intuition and AI in the realm of mathematical discovery.