“Goals selected from learned knowledge: an alternative to RL alignment” by Seth Herd | LessWrong (30+ Karma) | Podwise