Portal | Level: L2: Operations | Topics: Prometheus, Kubernetes Networking | Domain: DevOps & Tooling
Symptoms: Grafana Dashboard Empty, Prometheus Scrape Blocked by NetworkPolicy¶
| File | Purpose |
|---|---|
| symptoms.md | Problem statement and observable symptoms |
| questions.md | Diagnostic questions to work through |
| investigation.md | Full investigation path across domains |
| remediation.md | Cross-domain fix and verification |
| grading.md | Self-assessment rubric |
Cross-domain incident study — see cross-domain index for context.
Pages that link here¶
- API Gateways & Ingress - Primer
- Alerting Rules - Skill Check
- Alerting Rules Drills
- Capacity Planning - Primer
- Cilium & eBPF Networking - Primer
- Cross-Domain Incident Case Studies
- Grading Rubric
- Kubernetes Networking - Primer
- Kubernetes Services & Ingress - Primer
- Log Analysis & Alerting Rules (PromQL / LogQL) - Primer
- OpenTelemetry - Primer
- Ops Archaeology: The 5% That Can't Resolve
- Primer
- Primer
- Production Readiness Review: Answer Key
Wiki Navigation¶
Prerequisites¶
- Observability Deep Dive (Topic Pack, L2)
Related Content¶
- API Gateways & Ingress (Topic Pack, L2) — Kubernetes Networking
- Adversarial Interview Gauntlet (30 sequences) (Scenario, L2) — Prometheus
- Alerting Rules (Topic Pack, L2) — Prometheus
- Alerting Rules Drills (Drill, L2) — Prometheus
- Capacity Planning (Topic Pack, L2) — Prometheus
- Case Study: CNI Broken After Restart (Case Study, L2) — Kubernetes Networking
- Case Study: Canary Deploy Routing to Wrong Backend — Ingress Misconfigured (Case Study, L2) — Kubernetes Networking
- Case Study: CoreDNS Timeout Pod DNS (Case Study, L2) — Kubernetes Networking
- Case Study: Disk Full — Runaway Logs, Fix Is Loki Retention (Case Study, L2) — Prometheus
- Case Study: Service Mesh 503s — Envoy Misconfigured, RBAC Policy (Case Study, L2) — Kubernetes Networking