09 Apr 2026

Reiner Pope (MatX): Designing AI Chips From First Principles for LLMs

Semi Doped

MatX optimizes LLM performance by developing specialized chips that prioritize matrix multiplication and a hybrid SRAM-HBM memory architecture. Unlike traditional GPU-centric models, this design addresses the prohibitive costs of large-scale inference by eliminating weight-loading bottlenecks and enabling significantly lower latency. As frontier labs shift toward multi-platform strategies to manage massive compute expenditures, the economic incentive to adopt custom silicon has grown. By co-designing hardware with advanced attention research and numerics, MatX provides a pathway for model labs to achieve greater efficiency and headroom. While the company faces the substantial challenge of scaling manufacturing to meet gigawatt-scale data center requirements, its focus on workload-specific hardware—including custom network topologies for mixture-of-experts models—positions it to compete directly with established incumbents in the evolving AI infrastructure landscape.

Outlines

Continue

Preview

How to Get Rich: Every EpisodeNaval

Reiner Pope (MatX): Designing AI Chips From First Principles for LLMs

Semi Doped

Founding MatX and Challenging the Software Lock-in Myth

Architectural Innovation and First-Principles Hardware Design

Scaling Systems and Integrating AI into Chip Design

Reiner Pope (MatX): Designing AI Chips From First Principles for LLMs

Semi Doped

00:00Founding MatX and Challenging the Software Lock-in Myth

Founding MatX and Challenging the Software Lock-in Myth

09:45Architectural Innovation and First-Principles Hardware Design

Architectural Innovation and First-Principles Hardware Design

20:33Scaling Systems and Integrating AI into Chip Design

Scaling Systems and Integrating AI into Chip Design