Random Samples: On scalable RL in the era of agentic LLMs | Red Hat | Podwise