Measuring Progress on Scalable Oversight for Large Language Models | AI Safety Fundamentals | Podwise