Databricks Delta Live Tables (DLT) Cheat Sheet

Updated 2026-05-23

Next Topic: Databricks Notebooks Cheat Sheet

Delta Live Tables (DLT) — now called Lakeflow Spark Declarative Pipelines (SDP) — is Databricks' framework for building reliable batch and streaming ETL pipelines using a declarative approach in SQL or Python. Instead of writing imperative orchestration code, you declare what each table should contain and let the framework handle execution order, dependency resolution, retries, and infrastructure. The key mental model to carry into every DLT table is that a pipeline is a directed acyclic graph of datasets: changing any node's definition only requires updating that node and letting DLT recompute what's affected — you never manage the execution sequence yourself.

What This Cheat Sheet Covers

This topic spans 14 focused tables and 100 indexed concepts. Below is a complete table-by-table outline of this topic, spanning foundational concepts through advanced details.

Table 1: Core Dataset TypesTable 2: Python API Decorators and FunctionsTable 3: SQL Syntax for Pipeline DefinitionsTable 4: Expectations and Data QualityTable 5: Change Data Capture (CDC) — AUTO CDC APIsTable 6: Pipeline Modes and TriggersTable 7: Autoscaling and ComputeTable 8: Unity Catalog IntegrationTable 9: Incremental Refresh PatternsTable 10: Event Log and MonitoringTable 11: Flows and Multi-Source PatternsTable 12: Pipeline Configuration and ParametersTable 13: Error Handling and RecoveryTable 14: Lakeflow Spark Declarative Pipelines — Naming and Migration Context

Table 1: Core Dataset Types

Streaming tables and materialized views are the two primary building blocks of every DLT pipeline; understanding which to use — and why — is the first decision any pipeline author makes.

Type	Example	Description
Streaming Table	`@dp.table()` `def orders_bronze():` `return spark.readStream.table("raw_orders")`	• Processes append-only, incremental data • maintains a streaming checkpoint so only new rows are consumed each run
Materialized View	`@dp.materialized_view()` `def orders_agg():` `return spark.read.table("orders_silver").groupBy("date").agg(...)`	• Pre-computes and stores the result of a query • supports incremental refresh for eligible SQL operations and falls back to full recompute otherwise

Table 1: Core Dataset Types

Streaming tables and materialized views are the two primary building blocks of every DLT pipeline; understanding which to use — and why — is the first decision any pipeline author makes.

Type	Example	Description
Streaming Table	`@dp.table()` `def orders_bronze():` `return spark.readStream.table("raw_orders")`	• Processes append-only, incremental data • maintains a streaming checkpoint so only new rows are consumed each run
Materialized View	`@dp.materialized_view()` `def orders_agg():` `return spark.read.table("orders_silver").groupBy("date").agg(...)`	• Pre-computes and stores the result of a query • supports incremental refresh for eligible SQL operations and falls back to full recompute otherwise