Kubernetes Ops Domain¶
Domain guide: browse all Kubernetes content organized as a learning sequence.
Curated path through Kubernetes content -- from basics to production ops.
Learning Path¶
L0-L1: Core Kubernetes¶
- K8s Exercises (Quest Ladder) (CLI) — 50+ exercises across levels
- Runtime Lab: Readiness Probe (CLI) —
make lab LAB=lab-runtime-01 - Runtime Lab: HPA Scaling (CLI) —
make lab LAB=lab-runtime-02 - Runtime Lab: OOMKilled (CLI) —
make lab LAB=lab-runtime-08
L2: Production Operations¶
- K8s Ops Topic Pack - node upgrades, PDBs, debugging
- Primer | Street Ops | Drills | Skillcheck
- kubectl Debugging Cheatsheet
- kubectl Drills
L2-L3: Incident Response¶
- Incident Simulator (CLI) — 18 injectable failure scenarios (
make incident) - Investigation Engine (CLI) — guided debugging (
make investigate) - 13 Runbooks - structured triage guides
- Chaos Scripts (CLI) — safe reversible failures
Runbooks (by failure type)¶
| Failure | Runbook |
|---|---|
| CrashLoopBackOff | crashloopbackoff.md |
| ImagePullBackOff | imagepullbackoff.md |
| OOMKilled | oomkilled.md |
| Readiness Probe Failed | readiness_probe_failed.md |
| DNS Resolution | dns_resolution.md |
| NetworkPolicy Block | networkpolicy_block.md |
| HPA Not Scaling | hpa_not_scaling.md |
Study & Review¶
- K8s Flashcards — browse K8s cards online
- K8s Quiz — self-test questions
- All Flashcard Decks — 196 decks, 5700+ cards
- Incident Scenarios — 11 incident response drills
Assessment¶
Pages that link here¶
- Kubernetes - Skill Check
- Kubernetes Under the Covers
- Operational Runbooks
- Practical Kubernetes Ops - Street Ops
- Runbook: DNS Resolution Failure
- Runbook: HPA Not Scaling
- Runbook: ImagePullBackOff
- Runbook: NetworkPolicy Blocking Traffic
- Runbook: OOMKilled Container
- Runbook: Pod CrashLoopBackOff
- Runbook: Readiness Probe Failed
- kubectl Drills