
In this episode of the Latent Space Podcast, Alessio interviews Quentin Anthony, Head of Model Training at Zyphra and advisor at Eleuther AI, about Zyphra's work on foundation models for edge deployment and their recent move to AMD training clusters. Quentin shares insights on optimizing kernels for AMD GPUs, the role of open source in AMD development, and the use of coding agents. They discuss the METR study on AI's impact on software engineering productivity, Quentin's coding workflow with AI, and the challenges of evaluating AI-generated kernels. The conversation also covers Zyphra's model development roadmap, the potential of ASICs for inference, edge deployment strategies, and the future of open source AI with Eleuther AI.
Sign in to continue reading, translating and more.
Continue