Portal | Level: L2: Operations | Topics: Postmortems & SLOs | Domain: DevOps & Tooling
Postmortem & SLO/SLI - Skill Check¶
Mental model (bottom-up)¶
SLIs measure user experience. SLOs set targets. Error budgets create objective tradeoffs between reliability and velocity. Postmortems turn incidents into organizational learning.
Visual stack¶
[SLA ] business contract (99.5% or refund)
|
[SLO ] engineering target (99.9%)
|
[Error Budget ] 1 - SLO = allowed failure (0.1% = 43 min/month)
|
[SLI ] actual measurement (99.2% this month)
Glossary¶
- SLI - Service Level Indicator; metric measuring quality (e.g., success rate)
- SLO - Service Level Objective; target for an SLI (e.g., 99.9%)
- SLA - Service Level Agreement; contract with consequences
- error budget - allowed unreliability (1 - SLO)
- burn rate - how fast you're consuming error budget
- blameless postmortem - focuses on systems, not individuals
Core questions (easy -> hard)¶
- What's the difference between SLI, SLO, and SLA?
- SLI: measurement. SLO: target. SLA: contract. SLA < SLO < theoretical max.
- What is an error budget?
- 1 - SLO. For 99.9%: 0.1% = 43.2 min/month of allowed downtime.
- What makes a good postmortem?
- Blameless, specific root cause, timeline with UTC, owned action items, runbook link.
- Why use percentiles instead of averages for latency?
- Averages hide tail. p50=50ms + p99=5s = avg looks fine but 1% of users suffer.
- What is a burn rate alert?
- Alerts when error budget is consumed too fast. 14.4x = all budget in 1 hour.
- What happens when error budget is exhausted?
- Error budget policy: feature freeze, focus on reliability, mandatory postmortem.
Wiki Navigation¶
Prerequisites¶
- Postmortems & SLOs (Topic Pack, L2)
Related Content¶
- Devops Flashcards (CLI) (flashcard_deck, L1) — Postmortems & SLOs
- Postmortem & SLO Drills (Drill, L2) — Postmortems & SLOs
- Postmortem SLO Flashcards (CLI) (flashcard_deck, L1) — Postmortems & SLOs
- Postmortems & SLOs (Topic Pack, L2) — Postmortems & SLOs
- SRE Practices (Topic Pack, L2) — Postmortems & SLOs