The Semi Doped podcast features Austin Lyons and Vik Shaker dissecting NVIDIA's GTC keynote, focusing on the shift towards agentic AI and its implications for compute needs. They explore Jensen Huang's framing of AI's evolution, from training to inference and now agentic AI, requiring exponentially more tokens. The discussion highlights the introduction of tiers for AI inferencing, from free, basic models to ultra-premium, low-latency options like Groq. The speakers analyze the Vera Rubin system, encompassing GPU and CPU racks, and the integration of Groq for accelerated decoding. They also touch on the debate between copper and optical interconnects for scaling AI infrastructure, and the potential of NVIDIA's DSX platform for digital twin simulations of data centers.
Part 1: Introduction, Keynote Overview
Part 2: AI Inference, Agentic Era, and Groq
Part 3: Hardware Architecture, Interconnects, and Systems
Part 4: Software, Security, and Enterprise Strategy
Sign in to continue reading, translating and more.
Open full episode in Podwise