Feb. 18, 2024, 8 p.m. | Tobias Macey

Data Engineering Podcast www.dataengineeringpodcast.com

Summary


A data lakehouse is intended to combine the benefits of data lakes (cost effective, scalable storage and compute) and data warehouses (user friendly SQL interface). Multiple open source projects and vendors have been working together to make this vision a reality. In this episode Dain Sundstrom, CTO of Starburst, explains how the combination of the Trino query engine and the Iceberg table format offer the ease of use and execution speed of data warehouses with the infinite storage and …

benefits compute cost cto data data lakehouse data lakes data warehouses foundation iceberg lakehouse multiple open source open source projects projects reality scalable sql storage summary together trino vendors vision warehouses

More from www.dataengineeringpodcast.com / Data Engineering Podcast

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Principal Machine Learning Engineer (AI, NLP, LLM, Generative AI)

@ Palo Alto Networks | Santa Clara, CA, United States

Consultant Senior Data Engineer F/H

@ Devoteam | Nantes, France