State of LLMs 2026: RLVR, GRPO, Inference Scaling — Sebastian Raschka | The MAD Podcast with Matt Turck | Podwise