2025 New Year Data Engineering Boot Camp starting January 6th

Zach Wilson

Taught by Zach Wilson

Founder at DataExpert.io

What you'll learn

100+ Hours of Content
5 live Q&A sessions with Zach
10 speaker sessions from industry experts
Get Certified and find a mentor
Free Access to AWS,Astronomer,Spark,Trino
Manage tables with Iceberg and Snowflake
Manage real-time data with Kafka
Build an awesome portfolio!

Learn directly from the experts

Zach Wilson

Zach Wilson

Founder at DataExpert.io

I have led teams of data engineers and software engineers at Airbnb, Facebook, and Netflix. My next goal is to upskill as many data knowledge workers as I can!

Course syllabus

99 lessons • 66+ hours of content • 9 assignments

January 2025 Bootcamp Kickoff
1
January 2025 Bootcamp Kickoff
Airflow + Trino
1
Orchestration and Airflow Fundamentals Day 1 Lecture
2
Orchestration and Airflow Fundamentals Day 1 Lab
3
Hard Orchestration Lessons Day 2 Lecture
4
Hard Orchestration Lessons Day 2 Lab
5
Cumulative DAGs in Production Day 3 Lecture
6
Cumulative DAGs in Production Day 3 Lab
Snowflake + dbt Basics
1
Snowflake Basics Day 1 Lecture
2
Snowflake Basics Day 1 Lab
3
dbt Basics Day 2 Lecture
4
dbt Basics Day 2 Lab
5
dbt Basics Day 3 Lecture
6
dbt Basics Day 3 Lab
Databricks Basics
1
Databricks Platform Overview Day 1 Lecture
2
Databricks Platform Overview Day 1 Lab
3
Introduction to Spark Day 2 Lecture
4
Introduction to Spark Day 2 Lab
5
Apache Spark Core Day 3 Lecture
6
Apache Spark Core Day 3 Lab
Advanced Spark on Databricks
1
Apache Spark Shuffle Joins Day 1 Lecture
2
Apache Spark Shuffle Joins Day 1 Lab
3
Apache Spark Memory Turning, Partitioning Day 2 Lecture
4
Apache Spark Memory Turning, Partitioning Day 2 Lab
5
Apache Spark Unit Testing Day 3 Lecture
6
Apache Spark Unit Testing Day 3 Lab
Snowflake + Advanced dbt
1
Snowflake Lecture
2
Snowflake Lab
3
Advanced dbt Day 2 Lecture
4
Advanced dbt Day 2 Lab
5
Advanced dbt Day 3 Lecture
6
Advanced dbt Day 3 Lab
Analytical Patterns and Advanced SQL
1
Applying Analytical Patterns Day 1 Lecture
2
Applying Analytical Patterns Day 1 Lab
3
Advanced SQL Patterns Day 2 Lecture
4
Advanced SQL Patterns Day 2 Lab
5
Analytical Patterns Recognizing Business Value Day 3 Lecture
6
Analytical Patterns Recognizing Business Value Day 3 Lab
Real Time Data (Spark and Kafka Streaming)
1
Advanced Spark Optimization Techniques Day 1 Lecture
2
Advanced Spark Optimization Techniques Day 1 Lab
3
Spark Structured Streaming Day 2 Lecture
4
Spark Structured Streaming Day 2 Lab
5
Deep Dive On Workflows Day 3 Lecture
6
Deep Dive On Workflows Day 3 Lab
Bonus - LLMs
1
RAG and LLMs Day1 Lecture
2
RAG and LLMs Day1 Lab
3
RAG and LLMs Day2 Lecture
4
RAG and LLMs Day2 Lab
5
LLMs Day 3 Part1
6
LLMs Day3 Part2
Career Development Sessions
1
Career Development - LinkedIn Optimization
2
Career Development - Resume Review
3
Career Development - Interview Help
4
Career Development - Data Modeling Interview
5
Career Development - Strategic Networking
Q&A with Zach Wilson
1
Q&A Week 1
2
Q&A Week 2
3
Q&A Week 3
4
Q&A Week 4
Guest Speaker Sessions
1
Jason Reid (cofounder of Tabular)
2
Shachar Meir
3
Brian Pulliam
4
Sundas Khalid
5
YZ
6
Prasad Rao
7
Joe Reis
TA Office Hours
1
TA Office Hour 1
2
TA Office Hour 2
3
TA Office Hour 3
4
TA Office Hour 4
5
TA Office Hour 5
6
TA Office Hour 6
Capstone Showcase Jan 2025
1
Capstone Showcase Jan 2025

Also included

IncludedData Engineer Interview Skills
16 lessons

Interview Skills

1The SQL Interview
2The Data Modeling Interview
3The Data Architecture Interview
4The Behavioral Interview
5The Data Structures and Algorithms Interview
6Data Structures and Algorithms Interview
7Product Sense Interview
8Behavioral Interview
9Data Modeling Interview
10Live SQL Training January 25th, Window Functions and Common Table Expressions
11Live SQL Training January 15th
12Scala Dataset vs Dataframe API [Dec 15, 2023]

AI and LLM

1LLM-Driven Data Engineering Day 1 Lab
2LLM-Driven Data Engineering Day 1 Lecture
3LLM-Driven Data Engineering Day 2 Lab
4LLM-Driven Data Engineering Day 2 Lecture
IncludedBuilding Pipelines with Iceberg and Airflow
11 lessons

Data Modeling with Iceberg and Trino

1The History of Data Lakes Lecture
2Iceberg Partitioning and Metadata Exploration Lab
3Mastering Data Lake Architectures Lecture
4Apache Iceberg Day 2 Lab
5Apache Iceberg Day 3 Lecture
6Apache Iceberg Day 3 Lab

Airflow Pipelines with Iceberg

1Setting Up Airflow for Week1 for Mac
2Orchestration and Airflow Fundamentals Lecture
3Orchestration and Airflow Fundamentals Lab
4Apache Iceberg Data Contracts Lecture
5Apache Iceberg Data Contracts Lab

Before you join

Prerequisites

Proficiency in Python and SQL, at least 6 months of experience in both
Basic understanding of Docker, Flink, and Kafka

Platform Access Included

AWSAWS
AstronomerAstronomer
GitHubGitHub
DatabricksDatabricks
SnowflakeSnowflake