The Latent Space podcast features Nader Khalil and Kyle Kranen from NVIDIA, discussing developer experience, GPU technology, and the company's internal culture. They recount Brev's acquisition by NVIDIA, emphasizing the shared goal of simplifying developer access to GPUs, and touch on NVIDIA's developer experience initiatives. The conversation explores Dynamo, a data center scale inference engine, detailing its role in optimizing inference at scale through techniques like disaggregation, prefill, and decode. They also discuss "SOL" (Speed of Light) as a concept for creating urgency and understanding theoretical limits, and explore the balance between stability and innovation. The podcast further examines the evolution of AI, the importance of hardware-model co-design, and the potential of agents in coding and business applications.
Sign in to continue reading, translating and more.
Continue