How Zyphra went all-in on AMD + Why Devs feel faster with AI but are slower — with Quentin Anthony
Latent Space: The AI Engineer Podcast
In this episode of the Latent Space Podcast, Alessio interviews Quentin Anthony, Head of Model Training at Zyphra and advisor at Eleuther AI, about Zyphra's work on foundation models for edge deployment and their recent move to AMD training clusters. Quentin shares insights on optimizing kernels for AMD GPUs, the role of open source in AMD development, and the use of coding agents. They discuss the METR study on AI's impact on software engineering productivity, Quentin's coding workflow with AI, and the challenges of evaluating AI-generated kernels. The conversation also covers Zyphra's model development roadmap, the potential of ASICs for inference, edge deployment strategies, and the future of open source AI with Eleuther AI.
Part 1: Zyphra, AMD, and Open Source
Part 2: Kernel Development and Hardware Considerations
Part 3: Edge Deployment and Future Roadmap
Part 4: AI Coding Productivity and Workflow
Part 5: Team Building, Interviewing, and Open Source
Sign in to continue reading, translating and more.
Open full episode in Podwise