Purchase Required

You need to purchase this content in order to view it

Apache Spark Shuffle Joins Day 1 Lab

Module
35 mins

Description

In this lab, Zach walks through the process of joining two datasets related to NBA games and players, focusing on the performance implications of different join strategies. He demonstrates how to disable broadcast joins and analyze the resulting execution plans to better understand the underlying processes. He also highlights the importance of using formatted modes for clarity in job execution.