DataExpert

Spark, Databricks, Snowflake, Kafka, Flink — the most comprehensive data engineering curriculum on the internet.

Course syllabus

92 lessons • 57+ hours of content • 16 assignments

Kickoff
1
Bootcamp Kickoff
2
Boot Camp Database Setup
3
January 2025 Bootcamp Kickoff
4
Databricks Boot Camp Kickoff
5
Capstone Project Brainstorming
Week 1: Databricks Basics
1
Databricks Platform Overview Day 1 Lecture
2
Databricks Platform Overview Day 1 Lab
3
Introduction to Spark Day 2 Lecture
4
Introduction to Spark Day 2 Lab
5
Apache Spark Core Day 3 Lecture
6
Apache Spark Core Day 3 Lab
Week 2: Databricks & Advanced Spark
1
Apache Spark Shuffle Joins Day 1 Lecture
2
Apache Spark Memory Turning, Partitioning Day 2 Lecture
3
Apache Spark Memory Turning, Partitioning Day 2 Lab
4
Apache Spark Unit Testing Day 3 Lecture
5
Apache Spark Unit Testing Day 3 Lab
6
Setting Up CI/CD and Unit Testing in Databricks for Reliable Data Pipelines
7
Databricks and Advanced Spark Day1 Lecture
8
Databricks and Advanced Spark Day1 Lab
9
Databricks and Advanced Spark Day2 Lecture
10
Databricks and Advanced Spark Day2 Lab
11
Apache Spark Shuffle Joins Day 1 Lab
Week 3: Data Lakes with Delta Table
1
Delta Table Day 1 Lecture
2
Delta Table Day 1 Lab
3
Delta Lake Bonus
4
Delta Table Day 2 Lecture
5
Delta Table Day 2 Lab
6
Test Again
7
Data Lakes with Delta Table Day1 Lecture
8
Data Lakes with Delta Table Day1 Lab
9
Data Lakes with Delta Table Day2 Lecture
10
Data Lakes with Delta Table Day2 Lab
Week 4: Structured Streaming with Spark & Kafka
1
Apache Spark programming with Databricks Day 1 Lecture
2
Apache Spark programming with Databricks Day 1 Lab
3
Apache Spark programming with Databricks Day 2 Lecture
4
Apache Spark programming with Databricks Day 2 Lab
5
Structured Streaming Kafka to Delta Live Table Day1 Lecture
6
Structured Streaming Kafka to Delta Live Table Day1 Lab
7
Structured Streaming Kafka to Delta Live Table Day2 Lecture
8
Structured Streaming Kafka to Delta Live Table Day2 Lab
9
Exploring UDFs and SQL Benchmarks in Spark Streaming
10
Advanced Spark Optimization Techniques Day 1 Lecture
11
Advanced Spark Optimization Techniques Day 1 Lab
12
Spark Structured Streaming Day 2 Lecture
13
Spark Structured Streaming Day 2 Lab
14
Deep Dive On Workflows Day 3 Lecture
15
Deep Dive On Workflows Day 3 Lab
Week 5: Managing Unstructured Data
1
Managing Unstructured Data - Day 1 Lecture
2
Managing Unstructured Data - Day 1 Lab
3
Managing Unstructured Data - Day 2 Lecture
4
Managing Unstructured Data - Day 2 Lab
5
Managing Unstructured Data Day1 Lecture
6
Managing Unstructured Data Day1 Lab
7
Managing Unstructured Data Day2 Lecture
8
Managing Unstructured Data Day2 Lab
Week 6: Building AI Agents with Databricks
1
Building AI Agents with Databricks Day1 Lecture
2
Building AI Agents with Databricks Day1 Lab
3
Building AI Agents with Databricks Day2 Lecture
4
Building AI Agents with Databricks Day2 Lab
5
Enhancements and Implementations in MLflow
Capstone Project
1
Capstone May 2025
2
Capstone Showcase Jan 2025
Q&A with Zach
1
Q&A Week 1
2
Q&A Week 2
3
Q&A Week 3
4
Q&A Week 4
5
Q&A with Zach Week 1
6
Q&A with Zach Week 2
7
Q&A with Zach Week 3
8
Q&A with Zach Week 4
9
Q&A with Zach Week 5
10
Navigating Data Engineering -Tips, Tools, and Career Insights
11
Q&A with Zach Week1
12
Q&A with Zach Week2
13
Q&A with Zach Week3
14
Q&A with Zach Week4
15
Q&A with Zach Week5
Expert Guest Sessions
1
Alex Merced - Head of DevRel at Dremio
2
Joe Reis - Author of Fundamentals of Data Engineering
3
Shubham Srivastava - Senior Data Engineer at Amazon (2025-06-24)
4
Vaishali Macwan - Senior Data Scientist at Amazon
5
Jason Reid - Co-founder at Tabular
6
Shubham Srivastava - Senior Data Engineer at Amazon (02-01-2026)
7
Brian Pulliam - Career Development Coach
8
Xinran Waibel - Data Engineer at OpenAI
9
Jason Reid - Co-Founder at Tabular (2.0)
10
Shachar Meir - Data Advisor and Public Speaker
11
Brian Pulliam - Career Developement Coach (2.0)
12
Sundas Khalid - Principal Analytics Lead at Google
13
Yuzheng Sun - Staff Data Scientist at Statsig
14
Prasad Rao - Principal Solutions Architect
15
Joe Reis - Author of Fundamentals of Data Engineering (2.0)