From Academia to Industry: Bridging Data Engineering Challenges

In this episode of the Data Engineering Podcast, Tobias Macey interviews Paul Groth, a professor at the University of Amsterdam, about his research on knowledge graphs and data engineering. They discuss the evolution and nuances of data provenance and lineage, the role of the Intelligent Data Engineering Lab in bridging the gap between academic research and industry practices, and the challenges of managing data models and access control. The conversation explores the impact of large language models (LLMs) on knowledge graph construction, data integration, and the broader data engineering ecosystem, including the potential for LLMs to serve as databases themselves. They also touch on the changing landscape of computer architecture, edge computing, data federation, and the differences between data management in research and business contexts, highlighting the need for flexible data integration and the importance of human-AI collaboration in data engineering pipelines.

Outlines

Part 1: Introduction and Academic Focus

Part 2: Knowledge Graphs and LLMs

Part 3: Architecture and Data Management

Part 4: Conclusion

Sign in to continue reading, translating and more.

Continue

Data Engineering Podcast

Part 1: Introduction and Academic Focus

Introduction to Knowledge Graphs and Data Engineering with Paul Groth

Academic Research in Data Engineering: Focus Areas and Challenges

Part 2: Knowledge Graphs and LLMs

Knowledge Graphs and Large Language Models: A Renewed Ecosystem

Modeling and Querying Knowledge Graphs: Unification and Computer Architecture

Part 3: Architecture and Data Management

Evolving Computer Architectures and the Role of Databases

Data Management in Research vs. Business: Challenges and Opportunities

Lessons Learned and Future Research Directions in Data Management

Part 4: Conclusion

Call to Action and Podcast Outro

From Academia to Industry: Bridging Data Engineering Challenges

Data Engineering Podcast

Part 1: Introduction and Academic Focus

00:11Introduction to Knowledge Graphs and Data Engineering with Paul Groth

Introduction to Knowledge Graphs and Data Engineering with Paul Groth

07:39Academic Research in Data Engineering: Focus Areas and Challenges

Academic Research in Data Engineering: Focus Areas and Challenges

Part 2: Knowledge Graphs and LLMs

14:00Knowledge Graphs and Large Language Models: A Renewed Ecosystem

Knowledge Graphs and Large Language Models: A Renewed Ecosystem

20:50Modeling and Querying Knowledge Graphs: Unification and Computer Architecture

Modeling and Querying Knowledge Graphs: Unification and Computer Architecture

Part 3: Architecture and Data Management

28:36Evolving Computer Architectures and the Role of Databases

Evolving Computer Architectures and the Role of Databases

35:36Data Management in Research vs. Business: Challenges and Opportunities

Data Management in Research vs. Business: Challenges and Opportunities

42:07Lessons Learned and Future Research Directions in Data Management

Lessons Learned and Future Research Directions in Data Management

Part 4: Conclusion

50:05Call to Action and Podcast Outro

Call to Action and Podcast Outro