The End of SWE-Bench Verified — Mia Glaese & Olivia Watkins, OpenAI Frontier Evals | Latent Space | Podwise