Supervised Learning Cheat Sheet

Updated 2026-04-28

🧠Study flashcards on this topic144 cards · spaced repetition→

Supervised learning is a machine learning paradigm where models learn from labeled training data to make predictions on unseen data. Each training example consists of an input-output pair, enabling the algorithm to learn a mapping function from inputs to outputs. The two primary tasks are classification (predicting discrete categories) and regression (predicting continuous values). The fundamental challenge lies in balancing the bias-variance tradeoff: simple models underfit (high bias, low variance), complex models overfit (low bias, high variance), and the goal is finding a model that generalizes well by minimizing total error on unseen data. Modern practice increasingly combines strong models with explainability tools like SHAP and LIME to meet regulatory and interpretability demands.

What This Cheat Sheet Covers

This topic spans 27 focused tables and 165 indexed concepts. Below is a complete table-by-table outline of this topic, spanning foundational concepts through advanced details.

Table 1: Learning Task TypesTable 2: Core Classification AlgorithmsTable 3: Core Regression AlgorithmsTable 4: Ensemble MethodsTable 5: Naive Bayes VariantsTable 6: SVM Kernel FunctionsTable 7: Decision Tree Splitting CriteriaTable 8: Loss FunctionsTable 9: Regularization TechniquesTable 10: Classification Evaluation MetricsTable 11: Regression Evaluation MetricsTable 12: Cross-Validation StrategiesTable 13: Data Splitting StrategiesTable 14: Feature PreprocessingTable 15: Feature Selection MethodsTable 16: Handling Imbalanced DataTable 17: Hyperparameter TuningTable 18: Multilabel Classification StrategiesTable 19: Multiclass Decomposition StrategiesTable 20: Missing Value ImputationTable 21: Probability CalibrationTable 22: Model ExplainabilityTable 23: Calibration Evaluation MetricsTable 24: Bias-Variance TradeoffTable 25: Learning DiagnosticsTable 26: Distance and Similarity MetricsTable 27: Key Hyperparameters by Algorithm

Quick IndexSubscribe to unlock

A jump-to index of every table row in this cheat sheet.

Mind MapSubscribe to unlock

An interactive map of every table and concept in this topic.

Table 1: Learning Task Types

Before reaching for any algorithm, you decide what shape the output takes — a single discrete label, a continuous number, several labels at once, or a ranked category. Naming the task correctly here dictates the loss function, the metrics, and which models in the rest of this sheet even apply.

Type	Example	Description
Binary Classification	`y_pred = model.predict(X) # [0, 1, 1, 0]`	• Assigns each instance to one of two classes • output is a discrete binary label (positive/negative, yes/no, spam/ham).
Multiclass Classification	`y_pred = model.predict(X) # [0, 2, 1, 3]`	• Assigns each instance to exactly one of multiple classes (3+) • mutually exclusive, single-label output.
Regression	`y_pred = model.predict([[1500]]) # 250000.0`	• Predicts continuous numerical values from input features • output is real-valued rather than discrete.

Table 1: Learning Task Types

Type	Example	Description
Binary Classification	`y_pred = model.predict(X) # [0, 1, 1, 0]`	• Assigns each instance to one of two classes • output is a discrete binary label (positive/negative, yes/no, spam/ham).
Multiclass Classification	`y_pred = model.predict(X) # [0, 2, 1, 3]`	• Assigns each instance to exactly one of multiple classes (3+) • mutually exclusive, single-label output.
Regression	`y_pred = model.predict([[1500]]) # 250000.0`	• Predicts continuous numerical values from input features • output is real-valued rather than discrete.