Sycophancy to subterfuge: Investigating reward-tampering in large language models | Best AI papers explained | Podwise