Amazon Redshift is a fully managed, petabyte-scale cloud data warehouse service developed by AWS that enables fast, cost-effective analytics on massive datasets using columnar storage and massively parallel processing (MPP) architecture. Designed for OLAP workloads, it seamlessly integrates with the broader AWS ecosystem—querying data lakes via Spectrum, streaming from Kinesis, and federating to RDS—while offering both provisioned clusters for predictable workloads and serverless for on-demand elasticity. A critical mental model: Redshift distributes data across compute nodes using distribution keys and sorts it using sort keys; choosing these wisely is the single most impactful optimization you can make, as poor choices force costly data shuffles across the network during query execution.
Share this article