LessWrong (30+ Karma) - “What and Why: Developmental Interpretability of Reinforcement Learning” by Garrett Baker
Sign in to continue reading, translating and more.