RFT Launch, How OpenAI Improves Its Models & the State of AI Agents Today
Unsupervised Learning: Redpoint's AI Podcast
In this episode of Unsupervised Learning, Jacob Effron interviews Michelle Pokrass, a key figure behind GPT-4.1 at OpenAI. They discuss the model's focus on real-world utility and developer experience, moving beyond benchmark optimization by gathering feedback directly from users and startups to identify areas for improvement. Michelle shares insights on creating effective evals, the challenges of long-context evaluations, and the importance of instruction following. The conversation explores the current state of AI agents, the advancements in code generation, and the potential for fine-tuning models for specific tasks. Michelle also touches on the future of OpenAI's models, emphasizing a move towards more generalized AI solutions while also discussing the balance between model capabilities and practical implementation for enterprises.
Part 1: Introduction and Development
Part 2: AI Agents and Model Capabilities
Part 3: Future Outlook and Best Practices
Sign in to continue reading, translating and more.
Open full episode in Podwise
