Master data engineering and AI like a big tech engineer

Real Time Formula 1 Analytics

Big Bag Data

Chess.com Analytics

Trading Strategy for Crypto Currency - Analytics Engineering Capstone Submission

BetFlow - Real Time Sports Betting App

Showcase of student projects

Capstone Projects

Our students have gone on to work at companies like Meta, Airbnb and Amazon. As well as achieve 100% raises!

Fast Track Your Career

Immediate free cloud access to Databricks, AWS, Snowflake, Astronomer, and more!

Free Cloud Access with tons of hands on exercises

Weekly Guest Speaker Sessions

Why Choose DataExpert.io Academy?

Instructor

In this lab, Zach shows the students how to use Glue Job Runner and Iceberg to optimize the data processing. He goes over setting up the job, running Python functions, and using UDFs. He also demonstrates how to monitor the job and view the output table. Plus, he explains the benefits of using Iceberg for data compression and partitioning. [Recorded on May30th, 2024]

Advanced Spark (Day 2 Lab)

academy/2/course/305/spark-batch-day-2-lab-v4-transcript.json

Sign in to view content

Advanced Spark (Day 2 Lab)

Description