LessWrong (30+ Karma) - “Selective Generalization: Improving Capabilities While Maintaining Alignment” by ariana_azarbal, Matthew A. Clarke, jorio, Cailley Factor, cloud
Sign in to continue reading, translating and more.