“How the NanoGPT Speedrun WR dropped by 20% in 3 months” by larry-dial | LessWrong (30+ Karma) | Podwise