“AI companies’ eval reports mostly don’t support their claims” by Zach Stein-Perlman | LessWrong (30+ Karma) | Podwise