Measuring AI Ability to Complete Long Tasks | AI Safety Fundamentals | Podwise