20VC: Why Google Will Win the AI Arms Race & OpenAI Will Not | NVIDIA vs AMD: Who Wins and Why | The Future of Inference vs Training | The Economics of Compute & Why To Win You Must Have Product, Data & Compute with Steeve Morin @ ZML | The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

This 20VC podcast episode interviews Steeve Morin, founder of ZML, about the future of AI chips and inference. The conversation covers the inefficiencies of current GPU-centric approaches, particularly for inference, highlighting the cost savings possible with alternative hardware like AMD GPUs. Morin predicts a significant shift towards inference (95% of the market in five years) driven by the rise of agents and latency-bound reasoning, making current GPU strategies unsustainable. He advocates for software solutions that abstract away hardware dependencies, enabling seamless switching between providers and unlocking cost efficiencies. A key takeaway is the substantial cost difference between using NVIDIA and AMD GPUs for inference, with AMD offering up to a 4x efficiency gain in some cases.

Outlines

Sign in to continue reading, translating and more.

Continue

20VC: Why Google Will Win the AI Arms Race & OpenAI Will Not | NVIDIA vs AMD: Who Wins and Why | The Future of Inference vs Training | The Economics of Compute & Why To Win You Must Have Product, Data & Compute with Steeve Morin @ ZML

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

Introduction: NVIDIA's Marketing and the Importance of Compute Ownership

Sponsor Mentions and Introduction of Steeve Morin

ZML and the Multi-Model, Multi-Hardware Future

NVIDIA's Dominance, the PyTorch-CUDA Ecosystem, and Chip Market Dynamics

Inference vs. Training Infrastructure Needs and the Oversupply of Compute

NVIDIA's Inference Market Share and the Rise of Specialized Chips

The Future of Inference: Agents, Reasoning, and Compute Architectures

AMD's Go-to-Market Strategy and the Misconceptions about Inference

Model Scaling, Data Quality, and the Future of AI

DeepSeek, Mistral AI, and the Future of AI Competition

Conclusion: Recap and Final Thoughts

20VC: Why Google Will Win the AI Arms Race & OpenAI Will Not | NVIDIA vs AMD: Who Wins and Why | The Future of Inference vs Training | The Economics of Compute & Why To Win You Must Have Product, Data & Compute with Steeve Morin @ ZML

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

00:00Introduction: NVIDIA's Marketing and the Importance of Compute Ownership

Introduction: NVIDIA's Marketing and the Importance of Compute Ownership

01:10Sponsor Mentions and Introduction of Steeve Morin

Sponsor Mentions and Introduction of Steeve Morin

04:32ZML and the Multi-Model, Multi-Hardware Future

ZML and the Multi-Model, Multi-Hardware Future

09:17NVIDIA's Dominance, the PyTorch-CUDA Ecosystem, and Chip Market Dynamics

NVIDIA's Dominance, the PyTorch-CUDA Ecosystem, and Chip Market Dynamics

17:10Inference vs. Training Infrastructure Needs and the Oversupply of Compute

Inference vs. Training Infrastructure Needs and the Oversupply of Compute

24:52NVIDIA's Inference Market Share and the Rise of Specialized Chips

NVIDIA's Inference Market Share and the Rise of Specialized Chips

33:00The Future of Inference: Agents, Reasoning, and Compute Architectures

The Future of Inference: Agents, Reasoning, and Compute Architectures

44:13AMD's Go-to-Market Strategy and the Misconceptions about Inference

AMD's Go-to-Market Strategy and the Misconceptions about Inference

51:08Model Scaling, Data Quality, and the Future of AI

Model Scaling, Data Quality, and the Future of AI

1:03:07DeepSeek, Mistral AI, and the Future of AI Competition

DeepSeek, Mistral AI, and the Future of AI Competition

1:09:03Conclusion: Recap and Final Thoughts

Conclusion: Recap and Final Thoughts