Statistics is the mathematical science of collecting, organizing, analyzing, and interpreting numerical data to make informed decisions under uncertainty. It bridges probability theory (which models randomness) and practical data analysis (which extracts patterns from observations), forming the foundation for fields ranging from machine learning to clinical trials. Statistics divides into descriptive statistics (summarizing data you have) and inferential statistics (generalizing from samples to populations), each serving distinct but complementary roles. The key insight: variation is everywhereβstatistics gives us principled ways to measure it, understand it, and reason through it, transforming raw numbers into actionable knowledge.
What This Cheat Sheet Covers
This topic spans 27 focused tables and 196 indexed concepts. Below is a complete table-by-table outline of this topic, spanning foundational concepts through advanced details.
Table 1: Population vs Sample
| Concept | Example | Description |
|---|---|---|
all 10,000 employees | β’ Complete set of all individuals or observations of interest β’ typically too large to measure entirely. | |
random 100 employees | β’ Subset of population actually measured β’ must be representative to support valid inference. | |
ΞΌ = 65 (population mean)Greek letters | β’ Fixed value describing a population β’ usually unknown β’ denoted by Greek letters (ΞΌ, Ο, Ο). | |
xΜ = 67 (sample mean)Roman letters | β’ Calculated value from sample data used to estimate a parameter β’ denoted by Roman letters (xΜ, s, r). |