The Data Lakehouse: Sorting Hype, Concept, and Reality
Wayne Eckerson
August 12, 2020
The emerging “data lakehouse” concept describes a hybrid data management environment that combines the characteristics of a data warehouse and a data lake. This report explores how enterprise and data architects approach the lakehouse concept, and the roles that lakehouses will play in modern enterprises.
Broadly speaking, the data lakehouse builds high-performance SQL data warehouse structures onto economical and flexible cloud data lake object storage. As with any collision of traditional and bleeding-edge technologies, the data lakehouse invites detractors as well as proponents.
Dave Wells examines the concept from an architect’s perspective. He outlines the pros and cons, while warning about the dangers of the “monolith.” Wayne Eckerson and Kevin Petrie square off with alternative perspectives. Wayne proposes that modern cloud data warehouses will dominate the market. Kevin believes new data lake technology will play a critical role in modern enterprise data environments.
Prefer to listen and watch? Replay our freewheeling Shop Talk discussion from April 17, 2020. Eckerson Group researchers and consultants plus guest Jason Nadeau of Dremio discuss the pros and cons of data lakehouses. The live audience also chimed in with valuable comments and questions. Show us your passion for this topic by commenting on our articles or sending us direct messages.