A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce | Xiaol.x | Podwise