Amazon SageMaker is AWS's fully managed end-to-end machine learning platform, covering every stage from data preparation and feature engineering through model training, tuning, deployment, and production monitoring. It eliminates the heavy lifting of infrastructure provisioning so practitioners can focus on model quality. The platform spans a wide spectrum β from no-code tools like Canvas and Autopilot, to low-level Python SDK primitives like Estimator and HyperparameterTuner β making it relevant whether you're a data scientist, ML engineer, or business analyst. The critical mental model is that SageMaker is not a single service but a suite of tightly integrated capabilities: data flows from processing β training β registry β endpoint, and MLOps tooling (Pipelines, Projects, Model Monitor) wraps that lifecycle for automation and governance.
What This Cheat Sheet Covers
This topic spans 19 focused tables and 141 indexed concepts. Below is a complete table-by-table outline of this topic, spanning foundational concepts through advanced details.
Table 1: SageMaker Studio and Development Environments
SageMaker Studio is the primary integrated development environment for the platform, providing a browser-based workspace where data scientists and engineers access notebooks, experiments, pipelines, and model governance in one place. Understanding the Studio architecture β domains, user profiles, and application types β is foundational for configuring team access and customizing compute environments.
| Feature | Example | Description |
|---|---|---|
Single browser IDE for SQL, notebooks, pipelines, MLflow | Unified IDE that consolidates data engineering, ML development, and generative AI workflows into one experience with access to all AWS analytics and AI services. | |
CreateDomain with VPC, auth mode, default settings | Top-level organizational unit that scopes a team's Studio environment; contains user profiles, apps, and shared resources within a VPC. | |
CreateUserProfile with per-user execution role | Per-user configuration within a domain that can override domain defaults for instance types, images, and execution roles. | |
Persistent workspace with kernel gateway app | Private or shared compute environment running JupyterLab; persists notebooks and installed packages between sessions. | |
Connect local VS Code / Cursor to SageMaker compute | Browser or local IDE experience powered by Code-OSS; supports local IDE connections via AWS Toolkit so you use your own editor against SageMaker compute. |