Statistics Cheat Sheet

Updated 2026-04-27

Next Topic: Statistics Fundamentals Cheat Sheet

🧠Study flashcards on this topic225 cards · spaced repetition→

Statistics is the science of collecting, analyzing, interpreting, and presenting data to extract meaningful insights and inform decision-making. Rooted in probability theory and mathematical principles, it serves as the foundation for data science, scientific research, business intelligence, and evidence-based policy. The field divides into descriptive statistics (summarizing and visualizing data) and inferential statistics (drawing conclusions about populations from samples). A crucial mental model: uncertainty is inherent in data—statistics provides rigorous frameworks to quantify that uncertainty, assess the reliability of findings, and distinguish signal from noise.

What This Cheat Sheet Covers

This topic spans 32 focused tables and 262 indexed concepts. Below is a complete table-by-table outline of this topic, spanning foundational concepts through advanced details.

Table 1: Measures of Central TendencyTable 2: Measures of DispersionTable 3: Probability FundamentalsTable 4: Probability DistributionsTable 5: Descriptive Statistics MeasuresTable 6: Data Types and ScalesTable 7: Sampling MethodsTable 8: Hypothesis Testing FundamentalsTable 9: Statistical Errors and PowerTable 10: Parametric TestsTable 11: Non-Parametric TestsTable 12: Post-Hoc TestsTable 13: Correlation and RegressionTable 14: Generalized Linear ModelsTable 15: Categorical Data AnalysisTable 16: Effect Size MeasuresTable 17: Statistical AssumptionsTable 18: Normality TestsTable 19: Outlier Detection MethodsTable 20: Experimental Design TypesTable 21: Regression Diagnostics and AssumptionsTable 22: Model Selection CriteriaTable 23: Multivariate Statistical MethodsTable 24: Survival Analysis MethodsTable 25: Mixed and Hierarchical ModelsTable 26: Resampling MethodsTable 27: Bayesian Statistics ConceptsTable 28: Causal Inference MethodsTable 29: Time Series AnalysisTable 30: Meta-Analysis MethodsTable 31: Missing Data MethodsTable 32: Data Visualization Types

Quick IndexSubscribe to unlock

A jump-to index of every table row in this cheat sheet.

Mind MapSubscribe to unlock

An interactive map of every table and concept in this topic.

Table 1: Measures of Central Tendency

These are the different ways to answer the deceptively simple question "what's a typical value?" Each measure pulls the center toward something different—the mean chases every value including outliers, the median holds steady in the middle, and the specialized means (geometric, harmonic, weighted) exist because averaging growth rates or speeds the naive way gives wrong answers.

Measure	Example	Description
Mean	`mean = sum(x) / n`	• Arithmetic average of all values • sensitive to outliers and best suited for symmetric distributions.
Median	`median = sorted(x)[n//2]`	• Middle value in sorted data • robust to outliers and preferred for skewed distributions.
Mode	`mode = most_frequent(x)`	• Most frequently occurring value • useful for categorical data and identifying peaks in distributions.
Weighted mean	`sum(x * w) / sum(w)`	Average where each value has an assigned weight reflecting its importance or frequency.

Table 1: Measures of Central Tendency

Measure	Example	Description
Mean	`mean = sum(x) / n`	• Arithmetic average of all values • sensitive to outliers and best suited for symmetric distributions.
Median	`median = sorted(x)[n//2]`	• Middle value in sorted data • robust to outliers and preferred for skewed distributions.
Mode	`mode = most_frequent(x)`	• Most frequently occurring value • useful for categorical data and identifying peaks in distributions.
Weighted mean	`sum(x * w) / sum(w)`	Average where each value has an assigned weight reflecting its importance or frequency.