“Scalable Oversight and Weak-to-Strong Generalization: Compatible approaches to the same problem” by Ansh Radhakrishnan, Buck, ryan_greenblatt, Fabien Roger
LessWrong (30+ Karma) - “Scalable Oversight and Weak-to-Strong Generalization: Compatible approaches to the same problem” by Ansh Radhakrishnan, Buck, ryan_greenblatt, Fabien Roger