Databricks AI Data Engineer Boot Camp

Course syllabus

102 lessons • 62+ hours of content • 16 assignments

Mastering LTAP modeling and Lakebase
1
Lesson 1
Building Rich Context for AI Agents
1
Lesson 1
Mastering Apache Spark
1
Lesson 1
Deploying AI agents with Agent Bricks
1
Lesson 1
Real Time data with Delta Live Table
1
Lesson 1

Also included

DataExpert logo
IncludedDataExpert
Spark, Databricks, Snowflake, Kafka, Flink — the most comprehensive data engineering curriculum on the internet.97 lessons

Kickoff

1Bootcamp Kickoff
2Boot Camp Database Setup
3January 2025 Bootcamp Kickoff
4Databricks Boot Camp Kickoff
5Capstone Project Brainstorming

Week 1: Databricks Basics

1Databricks Platform Overview Day 1 Lecture
2Databricks Platform Overview Day 1 Lab
3Introduction to Spark Day 2 Lecture
4Introduction to Spark Day 2 Lab
5Apache Spark Core Day 3 Lecture
6Apache Spark Core Day 3 Lab

Week 2: Databricks & Advanced Spark

1Apache Spark Shuffle Joins Day 1 Lecture
2Apache Spark Memory Turning, Partitioning Day 2 Lecture
3Apache Spark Memory Turning, Partitioning Day 2 Lab
4Apache Spark Unit Testing Day 3 Lecture
5Apache Spark Unit Testing Day 3 Lab
6Setting Up CI/CD and Unit Testing in Databricks for Reliable Data Pipelines
7Databricks and Advanced Spark Day1 Lecture
8Databricks and Advanced Spark Day1 Lab
9Databricks and Advanced Spark Day2 Lecture
10Databricks and Advanced Spark Day2 Lab
11Apache Spark Shuffle Joins Day 1 Lab

Week 3: Data Lakes with Delta Table

1Delta Table Day 1 Lecture
2Delta Table Day 1 Lab
3Delta Lake Bonus
4Delta Table Day 2 Lecture
5Delta Table Day 2 Lab
6Test Again
7Data Lakes with Delta Table Day1 Lecture
8Data Lakes with Delta Table Day1 Lab
9Data Lakes with Delta Table Day2 Lecture
10Data Lakes with Delta Table Day2 Lab

Week 4: Structured Streaming with Spark & Kafka

1Apache Spark programming with Databricks Day 1 Lecture
2Apache Spark programming with Databricks Day 1 Lab
3Apache Spark programming with Databricks Day 2 Lecture
4Apache Spark programming with Databricks Day 2 Lab
5Structured Streaming Kafka to Delta Live Table Day1 Lecture
6Structured Streaming Kafka to Delta Live Table Day1 Lab
7Structured Streaming Kafka to Delta Live Table Day2 Lecture
8Structured Streaming Kafka to Delta Live Table Day2 Lab
9Exploring UDFs and SQL Benchmarks in Spark Streaming
10Advanced Spark Optimization Techniques Day 1 Lecture
11Advanced Spark Optimization Techniques Day 1 Lab
12Spark Structured Streaming Day 2 Lecture
13Spark Structured Streaming Day 2 Lab
14Deep Dive On Workflows Day 3 Lecture
15Deep Dive On Workflows Day 3 Lab

Week 5: Managing Unstructured Data

1Managing Unstructured Data - Day 1 Lecture
2Managing Unstructured Data - Day 1 Lab
3Managing Unstructured Data - Day 2 Lecture
4Managing Unstructured Data - Day 2 Lab
5Managing Unstructured Data Day1 Lecture
6Managing Unstructured Data Day1 Lab
7Managing Unstructured Data Day2 Lecture
8Managing Unstructured Data Day2 Lab

Week 6: Building AI Agents with Databricks

1Building AI Agents with Databricks Day1 Lecture
2Building AI Agents with Databricks Day1 Lab
3Building AI Agents with Databricks Day2 Lecture
4Building AI Agents with Databricks Day2 Lab
5Enhancements and Implementations in MLflow

Capstone Project

1Capstone May 2025
2Capstone Showcase Jan 2025

Bonus - Azure

1Azure - Week 1 (Copy)
2Azure - Week 2 (Copy)
3Azure - Week 3 (Copy)
4Azure - Week 4 (Copy)
5Azure - Week 5 (Copy)

Q&A with Zach

1Q&A Week 1
2Q&A Week 2
3Q&A Week 3
4Q&A Week 4
5Q&A with Zach Week 1
6Q&A with Zach Week 2
7Q&A with Zach Week 3
8Q&A with Zach Week 4
9Q&A with Zach Week 5
10Navigating Data Engineering -Tips, Tools, and Career Insights
11Q&A with Zach Week1
12Q&A with Zach Week2
13Q&A with Zach Week3
14Q&A with Zach Week4
15Q&A with Zach Week5

Expert Guest Sessions

1Alex Merced - Head of DevRel at Dremio
2Joe Reis - Author of Fundamentals of Data Engineering
3Shubham Srivastava - Senior Data Engineer at Amazon (2025-06-24)
4Vaishali Macwan - Senior Data Scientist at Amazon
5Shubham Srivastava - Senior Data Engineer at Amazon (02-01-2026)
6Jason Reid - Co-founder at Tabular
7Brian Pulliam - Career Development Coach
8Xinran Waibel - Data Engineer at OpenAI
9Jason Reid - Co-Founder at Tabular (2.0)
10Shachar Meir - Data Advisor and Public Speaker
11Brian Pulliam - Career Developement Coach (2.0)
12Sundas Khalid - Principal Analytics Lead at Google
13Yuzheng Sun - Staff Data Scientist at Statsig
14Prasad Rao - Principal Solutions Architect
15Joe Reis - Author of Fundamentals of Data Engineering (2.0)

Platform Access Included

DiscordDiscord
GitHubGitHub
AWSAWS
DatabricksDatabricks
anthropic
OpenAIOpenAI