In this monologue podcast, Tim Berglund discusses Apache Iceberg, an open table format, and its evolution from data warehouses to data lakes. He explains the need for open table formats, focusing on how Iceberg addresses consistency, transactionality, and schema management challenges. Berglund details Iceberg's logical architecture, including data files, manifest files, manifest lists, metadata files, and catalogs, emphasizing its pluggable nature and application in modern streaming environments. He also introduces Confluent's Tableflow, which integrates Iceberg semantics with Kafka topics, enabling real-time data accessibility as Iceberg tables.
Sign in to continue reading, translating and more.
Continue