Skip to main content

Menu

LEVEL 0
0/5 XP
HomeAboutTopicsPricingMy VaultStats

Categories

🤖 Artificial Intelligence
☁️ Cloud and Infrastructure
💾 Data and Databases
💼 Professional Skills
🎯 Programming and Development
🔒 Security and Networking
📚 Specialized Topics
HomeAboutTopicsPricingMy VaultStats
LEVEL 0
0/5 XP
GitHub
© 2026 CheatGrid™. All rights reserved.
Privacy PolicyTerms of UseAboutContact

Databricks Delta Live Tables (DLT) Cheat Sheet

Databricks Delta Live Tables (DLT) Cheat Sheet

Back to Data Engineering
Updated 2026-05-23
Next Topic: Databricks Notebooks Cheat Sheet

Delta Live Tables (DLT) — now called Lakeflow Spark Declarative Pipelines (SDP) — is Databricks' framework for building reliable batch and streaming ETL pipelines using a declarative approach in SQL or Python. Instead of writing imperative orchestration code, you declare what each table should contain and let the framework handle execution order, dependency resolution, retries, and infrastructure. The key mental model to carry into every DLT table is that a pipeline is a directed acyclic graph of datasets: changing any node's definition only requires updating that node and letting DLT recompute what's affected — you never manage the execution sequence yourself.

What This Cheat Sheet Covers

This topic spans 14 focused tables and 100 indexed concepts. Below is a complete table-by-table outline of this topic, spanning foundational concepts through advanced details.

Table 1: Core Dataset TypesTable 2: Python API Decorators and FunctionsTable 3: SQL Syntax for Pipeline DefinitionsTable 4: Expectations and Data QualityTable 5: Change Data Capture (CDC) — AUTO CDC APIsTable 6: Pipeline Modes and TriggersTable 7: Autoscaling and ComputeTable 8: Unity Catalog IntegrationTable 9: Incremental Refresh PatternsTable 10: Event Log and MonitoringTable 11: Flows and Multi-Source PatternsTable 12: Pipeline Configuration and ParametersTable 13: Error Handling and RecoveryTable 14: Lakeflow Spark Declarative Pipelines — Naming and Migration Context

Table 1: Core Dataset Types

Streaming tables and materialized views are the two primary building blocks of every DLT pipeline; understanding which to use — and why — is the first decision any pipeline author makes.

TypeExampleDescription
Streaming Table
@dp.table()
def orders_bronze():
return spark.readStream.table("raw_orders")
• Processes append-only, incremental data
• maintains a streaming checkpoint so only new rows are consumed each run
Materialized View
@dp.materialized_view()
def orders_agg():
return spark.read.table("orders_silver").groupBy("date").agg(...)
• Pre-computes and stores the result of a query
• supports incremental refresh for eligible SQL operations and falls back to full recompute otherwise

More in Data Engineering

  • Databricks Cheat Sheet
  • Databricks Notebooks Cheat Sheet
  • Airbyte Open-Source ELT Cheat Sheet
  • Azure Synapse Analytics Cheat Sheet
  • Data Wrangling Cheat Sheet
  • Great Expectations Data Quality Cheat Sheet
View all 61 topics in Data Engineering