24 Feb 2025
1h 12m

20VC: Why Google Will Win the AI Arms Race & OpenAI Will Not | NVIDIA vs AMD: Who Wins and Why | The Future of Inference vs Training | The Economics of Compute & Why To Win You Must Have Product, Data & Compute with Steeve Morin @ ZML

Podcast cover

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

This 20VC podcast episode interviews Steeve Morin, founder of ZML, about the future of AI chips and inference. The conversation covers the inefficiencies of current GPU-centric approaches, particularly for inference, highlighting the cost savings possible with alternative hardware like AMD GPUs. Morin predicts a significant shift towards inference (95% of the market in five years) driven by the rise of agents and latency-bound reasoning, making current GPU strategies unsustainable. He advocates for software solutions that abstract away hardware dependencies, enabling seamless switching between providers and unlocking cost efficiencies. A key takeaway is the substantial cost difference between using NVIDIA and AMD GPUs for inference, with AMD offering up to a 4x efficiency gain in some cases.

Outlines

Part 1: Introduction and Market Overview

Part 2: Inference Infrastructure and Alternatives

Part 3: Model Scaling and AI Competition

Part 4: Conclusion

Sign in to continue reading, translating and more.

Open full episode in Podwise