Trino Cheat Sheet

Updated 2026-05-28

Next Topic: Unity Catalog in Databricks Cheat Sheet

🧠Study flashcards on this topic149 cards · spaced repetition→

Trino is a distributed SQL query engine designed for interactive analytics on large datasets across heterogeneous data sources. Originally created at Facebook as Presto and later rebranded as Trino, it enables federated queries where you can join data from multiple data sources (databases, data lakes, object storage) through a single SQL interface without moving the data. Trino's MPP (massively parallel processing) architecture separates compute from storage, making it ideal for modern data lakehouse architectures. Key mental model: Trino doesn't store data — it's a query engine that coordinates distributed execution across worker nodes, pushing down operations to data sources whenever possible and pulling only necessary data into memory for processing.

What This Cheat Sheet Covers

This topic spans 26 focused tables and 189 indexed concepts. Below is a complete table-by-table outline of this topic, spanning foundational concepts through advanced details.

Table 1: Core Architecture ComponentsTable 2: Query Execution ModelTable 3: Connector ArchitectureTable 4: Catalog ConfigurationTable 5: SQL Dialect and Language FeaturesTable 6: Data TypesTable 7: Query Optimization TechniquesTable 8: Join StrategiesTable 9: Performance Tuning PropertiesTable 10: Resource ManagementTable 11: Memory ManagementTable 12: Fault ToleranceTable 13: Security and Access ControlTable 14: Deployment ModesTable 15: Trino GatewayTable 16: DML OperationsTable 17: EXPLAIN and Query AnalysisTable 18: Session PropertiesTable 19: Monitoring and ObservabilityTable 20: Client Libraries and ToolsTable 21: Metadata and Information SchemaTable 22: User-Defined FunctionsTable 23: Advanced SQL FeaturesTable 24: Data Formats and File TypesTable 25: Trino vs AlternativesTable 26: Starburst Enterprise Extensions

Quick IndexSubscribe to unlock

A jump-to index of every table row in this cheat sheet.

Mind MapSubscribe to unlock

An interactive map of every table and concept in this topic.

Table 1: Core Architecture Components

Trino's distributed design requires understanding each component's role before tuning or deploying the engine; every performance issue and failure mode traces back to how these parts interact.

Component	Example	Description
Coordinator	Single node coordinating query execution	• Parses, analyzes, optimizes, and schedules queries • manages worker nodes and client connections • single point of failure without Trino Gateway or HA setup.
Worker	Multiple nodes executing query tasks	• Process data and execute tasks assigned by coordinator • fetch data from connectors and perform computation • horizontally scalable for increased throughput.
Connector	Hive, Iceberg, PostgreSQL, Kafka connectors	• Plugin that provides interface to specific data source • translates Trino operations to native source operations • enables data source abstraction.

Table 1: Core Architecture Components

Trino's distributed design requires understanding each component's role before tuning or deploying the engine; every performance issue and failure mode traces back to how these parts interact.

Component	Example	Description
Coordinator	Single node coordinating query execution	• Parses, analyzes, optimizes, and schedules queries • manages worker nodes and client connections • single point of failure without Trino Gateway or HA setup.
Worker	Multiple nodes executing query tasks	• Process data and execute tasks assigned by coordinator • fetch data from connectors and perform computation • horizontally scalable for increased throughput.
Connector	Hive, Iceberg, PostgreSQL, Kafka connectors	• Plugin that provides interface to specific data source • translates Trino operations to native source operations • enables data source abstraction.