AF - Sycophancy to subterfuge: Investigating reward tampering in large language models by Evan Hubinger | The Nonlinear Library | Podwise