LessWrong (30+ Karma) - “Measuring the ability of Opus 4.5 to fool narrow classifiers” by Fabien Roger, John Hughes
Sign in to continue reading, translating and more.