Purchase Required

You need to purchase this content in order to view it

The History of Data Lakes Lecture

Module
46 mins

Description

In this lecture, we explore the evolution of big data processing technologies, starting with Java MapReduce, the impact of Hive, and the rise of Spark. Zach shares experiences from adopting Scala and Spark at Airbnb, discuss the power of Iceberg in data engineering, and how object storage technologies like S3 have changed the game. We also look at the benefits of Iceberg, its role in data restoration, and why partitioning strategies are so important for data engineers.