Data Governance

21 articles tagged with "Data Governance"

Top Tools for Data Lakehouse and Data Warehouse

Top Tools for Data Lakehouse and Data Warehouse

Choose a lakehouse for unified SQL, ML, and streaming - use open formats and governance to avoid lock-in and control costs.

13 min read
Data Engineering
How to Monitor Security in Databricks Lakehouses

How to Monitor Security in Databricks Lakehouses

Use Unity Catalog, system tables, SAT, and SIEM integrations to monitor lakehouse security, detect threats, and automate response.

14 min read
Data Engineering
Snowflake for Data Retention: Best Practices

Snowflake for Data Retention: Best Practices

Set Time Travel, Fail-safe, storage tiers and lifecycle policies to balance compliance, recovery, and storage cost in Snowflake.

10 min read
Data Engineering
Managing Domain Events in Event-Driven Architectures

Managing Domain Events in Event-Driven Architectures

Treat domain events as versioned API contracts—design for consumers, use outbox/CDC for reliable delivery, and enforce clear ownership.

14 min read
Data Engineering
Real-Time Ad Campaign Optimization with AI

Real-Time Ad Campaign Optimization with AI

AI and streaming data enable instant bid, budget, and audience adjustments to cut CPA, boost ROAS, and maintain governance.

14 min read
AI Engineering
How to Troubleshoot Cloud Data Warehouse Issues

How to Troubleshoot Cloud Data Warehouse Issues

Diagnose root causes—connections, slow queries, storage, and security—and apply targeted fixes to cut costs and boost cloud data warehouse performance.

14 min read
Data Engineering
How to Build Scalable Data Quality Frameworks

How to Build Scalable Data Quality Frameworks

Build a metadata-driven, automated data quality framework—prioritize critical data, automate validation, and monitor quality in real time.

15 min read
Data Engineering
5 Steps to Automate Data Profiling in Snowflake

5 Steps to Automate Data Profiling in Snowflake

Automate Snowflake data profiling with DMFs, tasks, streams and Snowsight; define metrics, store results, and monitor anomalies and costs.

19 min read
Data Engineering
Databricks Logging: Setup and Tips

Databricks Logging: Setup and Tips

Configure Python or Log4j logging in Databricks, centralize JSON logs to Unity Catalog or cloud storage, set retention and integrate monitoring.

10 min read
Data Engineering
Metadata-Driven Data Quality: How It Works

Metadata-Driven Data Quality: How It Works

Use metadata, lineage, and AI to automate validation, catch errors early, and scale data quality across pipelines.

15 min read
Data Engineering
Databricks for Anomaly Detection in Data Pipelines

Databricks for Anomaly Detection in Data Pipelines

Build real-time anomaly detection pipelines in Databricks using Delta Live Tables, Unity Catalog, Isolation Forest models, and SQL alerts.

16 min read
Data Engineering
Soda vs. Great Expectations: Data Quality Tools

Soda vs. Great Expectations: Data Quality Tools

Compare Soda's SQL/YAML real-time monitoring and Great Expectations' Python validations to pick the best data quality tool for your team's workflow.

11 min read
Data Engineering
Page 0 of 2Next