[Linkpost] “Incorrect Baseline Evaluations Call into Question Recent LLM-RL Claims” by shash42 | LessWrong (30+ Karma) | Podwise