LessWrong (30+ Karma) - “How To Use Model Internals In Training Is A Reasonable Line of Research” by Neel Nanda
Sign in to continue reading, translating and more.