YouTube08 Apr 2025

Apache Iceberg: What It Is and Why Everyone’s Talking About It.

Podcast cover

Confluent Developer

In this monologue podcast, Tim Berglund discusses Apache Iceberg, an open table format, and its evolution from data warehouses to data lakes. He explains the need for open table formats, focusing on how Iceberg addresses consistency, transactionality, and schema management challenges. Berglund details Iceberg's logical architecture, including data files, manifest files, manifest lists, metadata files, and catalogs, emphasizing its pluggable nature and application in modern streaming environments. He also introduces Confluent's Tableflow, which integrates Iceberg semantics with Kafka topics, enabling real-time data accessibility as Iceberg tables.

Outlines

Sign in to continue reading, translating and more.

Open full episode in Podwise