Front-Loading Reasoning: The Synergy between Pretraining and Post-Training Data | Best AI papers explained | Podwise