Reasoning with Sampling: Base Models Outperform RL | Best AI papers explained | Podwise