AI Safety Fundamentals - Weak-To-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Sign in to continue reading, translating and more.