Sign in to view content

Sign in to view this lesson and continue learning.

Managing Unstructured Data Day1 Lab

Description

In this video, Eumar walks you through the four notebooks focused on Unity Catalog, synthetic data generation, and using Databricks AI functions to parse receipts. He discusses the importance of handling PII data and how to effectively process data in parallel using Spark. He demonstrates how to create schemas, tables, and volumes, and encourages everyone to follow along and try the exercises at home. Additionally, he highlights the use of the Databricks AI DevKit for generating code and managing data.