13 Mar 2026

2h 31m

Dylan Patel — Deep Dive on the 3 Big Bottlenecks to Scaling AI Compute

Dwarkesh Podcast

The conversation centers on the future of AI compute, particularly bottlenecks in scaling AI capabilities. Dylan Patel, CEO of Semianalysis, provides insights into the semiconductor supply chain, power demands, and capital expenditures of major tech companies. He argues that while power and data centers were previous constraints, the focus has shifted to chip manufacturing, especially EUV lithography tools. Patel highlights Nvidia's strategic positioning and potential challenges for competitors like Google and even China. The discussion touches on the trade-offs between model size, compute efficiency, and the economic implications of AI infrastructure investments, suggesting a potential divergence between the US and China based on the speed of AI development.

Outlines

Part 1: Capex, Funding, and Compute Timelines

Part 2: GPU Economics and Value Models

Part 3: Silicon Strategy and Foundry Dynamics

Part 4: ASML and Lithography Bottlenecks

Part 5: Hardware Architecture and Performance

Part 6: Geopolitics and the China-US Race

Part 7: The Memory Crunch and Consumer Impact

Part 8: Infrastructure, Power, and Scaling

Part 9: Future Frontiers and Robotics

Open full episode in Podwise

Dylan Patel — Deep Dive on the 3 Big Bottlenecks to Scaling AI Compute

Dwarkesh Podcast

Part 1: Capex, Funding, and Compute Timelines

00:00Big Tech's $600 Billion Capex Forecast and AI Labs' Funding: A Compute Timeline

Big Tech's $600 Billion Capex Forecast and AI Labs' Funding: A Compute Timeline

01:41Hyperscaler Capex Spending: Turbine Deposits, Data Center Construction, and Power Agreements

Hyperscaler Capex Spending: Turbine Deposits, Data Center Construction, and Power Agreements

03:48Anthropic's Compute Constraints: Lower Quality Providers and Financial Freakouts

Anthropic's Compute Constraints: Lower Quality Providers and Financial Freakouts

06:17Acquiring Compute in a Pinch: Neo Clouds, Shorter-Term Deals, and Higher Prices

Acquiring Compute in a Pinch: Neo Clouds, Shorter-Term Deals, and Higher Prices

07:38Neocloud Capacity and Revenue Sharing: Anthropic's 50% Markup

Neocloud Capacity and Revenue Sharing: Anthropic's 50% Markup

09:32Locking in Compute: The Advantage of Early Commitment and GPU Depreciation Cycles

Locking in Compute: The Advantage of Early Commitment and GPU Depreciation Cycles

Part 2: GPU Economics and Value Models

11:21GPU Depreciation and Gross Margins: A TCO Model Perspective

GPU Depreciation and Gross Margins: A TCO Model Perspective

13:03GPU Utility and Value: GPT 5.4 and the Increasing Worth of H100s

GPU Utility and Value: GPT 5.4 and the Increasing Worth of H100s

15:51The Value of H100s with AGI Models and Dario's Conservative Compute Approach

The Value of H100s with AGI Models and Dario's Conservative Compute Approach

18:52The Rising Value of GPUs and the Elkin-Allen Effect on Model Margins

The Rising Value of GPUs and the Elkin-Allen Effect on Model Margins

21:18Long-Term Compute Contracts and Margin Accrual in the AI Supply Chain

Long-Term Compute Contracts and Margin Accrual in the AI Supply Chain

Part 3: Silicon Strategy and Foundry Dynamics

24:28Nvidia's Strategy: Fracturing Complementary Industries and TSMC's Allocations

Nvidia's Strategy: Fracturing Complementary Industries and TSMC's Allocations

26:07TSMC's Calculus: Market Signals and Nvidia's AGI-Pilled Approach

TSMC's Calculus: Market Signals and Nvidia's AGI-Pilled Approach

29:30Google's TPU Bottleneck and Anthropic's Compute Acquisition

Google's TPU Bottleneck and Anthropic's Compute Acquisition

31:23Google's Gemini ARR and AGI Awakening: Turbine Deposits and Power Agreements

Google's Gemini ARR and AGI Awakening: Turbine Deposits and Power Agreements

34:26Compute as the Biggest Bottleneck: The Semiconductor Supply Chain and Fab Construction

Compute as the Biggest Bottleneck: The Semiconductor Supply Chain and Fab Construction

35:59Shifting Capacity: From Mobile and PC to AI Chips and a Gigawatt Ceiling

Shifting Capacity: From Mobile and PC to AI Chips and a Gigawatt Ceiling

Part 4: ASML and Lithography Bottlenecks

37:03ASML as the Ultimate Bottleneck: EUV Tools and AI Compute Limits

ASML as the Ultimate Bottleneck: EUV Tools and AI Compute Limits

41:07Carl Zeiss and TSMC's CapEx: Bottlenecks and Nvidia's Earnings

Carl Zeiss and TSMC's CapEx: Bottlenecks and Nvidia's Earnings

42:35Sam Altman's Gigawatt Goal: EUV Tools and AI Chip Allocation

Sam Altman's Gigawatt Goal: EUV Tools and AI Chip Allocation

44:01ASML's Generosity: EUV Tool Improvements and Pricing

ASML's Generosity: EUV Tool Improvements and Pricing

46:10ASML's Supply Chain: Complex Components and AGI-Pilledness

ASML's Supply Chain: Complex Components and AGI-Pilledness

55:00Returning to 7nm: Multi-Patterning and Process Improvements

Returning to 7nm: Multi-Patterning and Process Improvements

Part 5: Hardware Architecture and Performance

58:12Unfair Comparisons: Numerics and Design Targets in GPU Performance

Unfair Comparisons: Numerics and Design Targets in GPU Performance

1:00:21Model Performance and Chip Communication: The Impact of Process Node Shrinking

Model Performance and Chip Communication: The Impact of Process Node Shrinking

1:02:30Hopper vs. Blackwell: Performance Differences and Packaging Limitations

Hopper vs. Blackwell: Performance Differences and Packaging Limitations

Part 6: Geopolitics and the China-US Race

1:05:00China's Semiconductor Ambitions: Scale vs. Process Technology

China's Semiconductor Ambitions: Scale vs. Process Technology

1:06:40China's Semiconductor Supply Chain: EUV Tools and Production Challenges

China's Semiconductor Supply Chain: EUV Tools and Production Challenges

1:08:26Chinese Production Capacity: DUV Tools and AGI Timelines

Chinese Production Capacity: DUV Tools and AGI Timelines

1:10:55China's Model Capabilities and the Compute Race: A Diverging Path

China's Model Capabilities and the Compute Race: A Diverging Path

1:12:21US Economic Growth and the Return on Invested Capital in Data Centers

US Economic Growth and the Return on Invested Capital in Data Centers

1:14:18Fast vs. Long Timelines: US Wins vs. China Wins in AI

Fast vs. Long Timelines: US Wins vs. China Wins in AI

Part 7: The Memory Crunch and Consumer Impact

1:16:10Memory Crunch: HBM vs. Commodity DRAM and Agentic Tasks

Memory Crunch: HBM vs. Commodity DRAM and Agentic Tasks

1:17:35The Highest Value Tasks: Time Sensitivity and the Demand for Speed

The Highest Value Tasks: Time Sensitivity and the Demand for Speed

1:18:39Chip IO and Bandwidth: The Constraints of DDR vs. HBM

Chip IO and Bandwidth: The Constraints of DDR vs. HBM

1:20:21Memory Capacity and System Design: The Four Constraints of GPU Performance