
The Semi Doped podcast features Austin Lyons and Vik Shaker dissecting NVIDIA's GTC keynote, focusing on the shift towards agentic AI and its implications for compute needs. They explore Jensen Huang's framing of AI's evolution, from training to inference and now agentic AI, requiring exponentially more tokens. The discussion highlights the introduction of tiers for AI inferencing, from free, basic models to ultra-premium, low-latency options like Groq. The speakers analyze the Vera Rubin system, encompassing GPU and CPU racks, and the integration of Groq for accelerated decoding. They also touch on the debate between copper and optical interconnects for scaling AI infrastructure, and the potential of NVIDIA's DSX platform for digital twin simulations of data centers.
Sign in to continue reading, translating and more.
Continue