“Twitter thread on AI safety evals” by Richard_Ngo | LessWrong (30+ Karma) | Podwise