16 Aug 2023
50m

The Mathematics of Training LLMs — with Quentin Anthony of Eleuther AI

Podcast cover

Latent Space: The AI Engineer Podcast

This podcast episode delves into various aspects of scaling up large language models and training transformer-based models, emphasizing the practical considerations, challenges, and limitations involved. It covers topics such as hardware setup, flops, quantization, distributed training techniques, and emerging research directions in deep learning.

Outlines

Sign in to continue reading, translating and more.

Open full episode in Podwise