Introduction to Databricks Lakehouse
Gang Wang
Senior Data Scientist
$$
$$

$$
$$
| All-Purpose | Jobs Cluster | |
|---|---|---|
| Mode | Interactive | Automated |
| Management | Manual | Auto-terminates |
| Cost | Higher (idle time) | Cost-optimized |
$$
SELECT COUNT(*) FROM bronze_sales;
-- 1,247,832 rows
SELECT COUNT(*) FROM silver_sales;
-- 1,189,456 rows (nulls removed)
SELECT COUNT(*) FROM gold_daily_revenue;
-- 365 rows (daily aggregates)
$$
$$
$$

$$
databricks.yml Asset Bundlebundle validate before deploying$$
targets:
production:
workspace:
root_path: /Shared/production
resources:
jobs:
nightly_sales_etl:
schedule:
quartz_cron: "0 0 3 * * ?"
$$
Introduction to Databricks Lakehouse