Descriptive statistics is the branch of statistics focused on summarizing and describing data using numerical measures and graphical representations. It forms the foundation of data analysis across fields from business intelligence to scientific research, providing the essential first step before any inferential work. Unlike inferential statistics which draws conclusions about populations, descriptive statistics simply characterizes the observed data itself. The key insight: nearly every dataset can be understood through three lensesβcenter (where most values cluster), spread (how widely values vary), and shape (how the distribution looks)βmaking these core concepts universally applicable regardless of domain.
What This Cheat Sheet Covers
This topic spans 15 focused tables and 107 indexed concepts. Below is a complete table-by-table outline of this topic, spanning foundational concepts through advanced details.
Table 1: Data Types and Measurement Levels
| Type | Example | Description |
|---|---|---|
colors = ['red', 'blue', 'green'] | β’ Categorical data with no inherent order β’ values are distinct labels that cannot be ranked (gender, country, product type). | |
satisfaction = ['low', 'medium', 'high'] | β’ Categorical data with meaningful order but unequal intervals β’ differences between levels are not quantifiable (education level, survey ratings). | |
temp_c = [0, 10, 20, 30] | β’ Numerical data with equal intervals but no true zero β’ ratios are meaningless (temperature in Celsius, calendar years). | |
height = [150, 165, 180] | β’ Numerical data with equal intervals and a true zero β’ all arithmetic operations including ratios are valid (height, weight, income, age). |