[Linkpost] “METR: Measuring AI Ability to Complete Long Tasks” by Zach Stein-Perlman | LessWrong (30+ Karma) | Podwise