20VC: Why Google Will Win the AI Arms Race & OpenAI Will Not | NVIDIA vs AMD: Who Wins and Why | The Future of Inference vs Training | The Economics of Compute & Why To Win You Must Have Product, Data & Compute with Steeve Morin @ ZML | The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch | Podwise
20VC: Why Google Will Win the AI Arms Race & OpenAI Will Not | NVIDIA vs AMD: Who Wins and Why | The Future of Inference vs Training | The Economics of Compute & Why To Win You Must Have Product, Data & Compute with Steeve Morin @ ZML
This 20VC podcast episode interviews Steeve Morin, founder of ZML, about the future of AI chips and inference. The conversation covers the inefficiencies of current GPU-centric approaches, particularly for inference, highlighting the cost savings possible with alternative hardware like AMD GPUs. Morin predicts a significant shift towards inference (95% of the market in five years) driven by the rise of agents and latency-bound reasoning, making current GPU strategies unsustainable. He advocates for software solutions that abstract away hardware dependencies, enabling seamless switching between providers and unlocking cost efficiencies. A key takeaway is the substantial cost difference between using NVIDIA and AMD GPUs for inference, with AMD offering up to a 4x efficiency gain in some cases.