Prometheus is an open-source monitoring and alerting toolkit designed for reliability and scalability in cloud-native and containerized environments. Born at SoundCloud in 2012 and now a CNCF graduated project, Prometheus operates on a pull-based model, scraping metrics from instrumented targets at regular intervals and storing them in an efficient time-series database (TSDB). What makes Prometheus particularly powerful is PromQL, its functional query language that enables real-time aggregation and analysis—paired with a flexible alerting system that routes notifications through Alertmanager. With Prometheus 3.0 (November 2024), the project added UTF-8 metric names, native OTLP ingestion, Remote Write 2.0, and a redesigned UI; keep in mind that Prometheus is metrics-first, not logs or traces, and its dimensional label model demands careful attention to cardinality to avoid performance degradation.
What This Cheat Sheet Covers
This topic spans 26 focused tables and 201 indexed concepts. Below is a complete table-by-table outline of this topic, spanning foundational concepts through advanced details.
Table 1: Metric Types
| Type | Example | Description |
|---|---|---|
http_requests_total{method="GET"} 1523 | • Cumulative value that only increases or resets on restart • use rate() or increase() to derive per-second rates or totals. | |
node_memory_active_bytes 4.2e+09 | • Instantaneous value that can go up or down • represents current state like memory usage, queue depth, or temperature. | |
http_request_duration_seconds_bucket{le="0.5"} 124http_request_duration_seconds_sum 53.2http_request_duration_seconds_count 144 | • Samples observations into configurable buckets with _sum and _count• use histogram_quantile() for server-side approximate percentiles. |