Inside Google’s massive AI capex (live)

AI infrastructure growth is currently driven by massive capital expenditure, with companies like Google allocating over $175 billion annually to support expanding compute needs. As the industry transitions from model training to inference, the requirement for gigawatt-scale data centers is evolving toward more flexible, geographically distributed deployments. Reliability standards, traditionally set at four nines, are being re-evaluated as compute costs dominate total service expenses, prompting a shift toward lower-reliability power delivery in exchange for increased capacity. Amin Vahdat, Chief Technologist for AI Infrastructure at Google, emphasizes that vertical integration—co-designing chips, software, and power systems—is essential for efficiency. While labor, power, and chip supply chains remain critical rate limiters, the future of data center design relies on optimizing power-to-space ratios and managing microgrid-like control systems to handle diverse, latency-sensitive workloads.

Outlines

Sign in to continue reading, translating and more.

Open full episode in Podwise

Catalyst with Shayle Kann

Scaling Data Center Infrastructure and Capital Expenditure

Inference Demand and Reliability Trade-offs

Behind-the-Meter Power and Grid Integration

Vertical Integration and Growth Rate Limiters

Physical AI and Long-term Infrastructure Efficiency

Inside Google’s massive AI capex (live)

Catalyst with Shayle Kann

01:32Scaling Data Center Infrastructure and Capital Expenditure

Scaling Data Center Infrastructure and Capital Expenditure

07:20Inference Demand and Reliability Trade-offs

Inference Demand and Reliability Trade-offs

14:10Behind-the-Meter Power and Grid Integration

Behind-the-Meter Power and Grid Integration

21:14Vertical Integration and Growth Rate Limiters

Vertical Integration and Growth Rate Limiters

27:32Physical AI and Long-term Infrastructure Efficiency

Physical AI and Long-term Infrastructure Efficiency