Interview Scenarios¶
DevOps/SRE interview scenarios tied to the runtime labs and runbooks in this repository. Each scenario simulates a real incident and includes:
- The prompt (what happened, what you see)
- The expected investigation path
- A strong answer
- Common traps
- Links to hands-on practice
How to Use¶
- Read the scenario prompt as if an interviewer just presented it
- Think through your investigation path before reading the answer
- Practice the commands on a live cluster using the linked runtime lab
- Review the linked runbook for a concise reference
Scenarios¶
Pages that link here¶
- Master Curriculum: 40 Weeks
- Operational Runbooks
- Scenario: 100% 503 Errors After Mesh Rollout
- Scenario: CI Failed Due to Vulnerability Scan
- Scenario: Cloud Cost Spike Investigation
- Scenario: Config Drift Detected in Production
- Scenario: Database Failover During Deployment
- Scenario: Deployment Stuck Progressing
- Scenario: Docker Container Won't Start in Production
- Scenario: GitOps Drift Causing Outage
- Scenario: HPA Not Scaling Under Load
- Scenario: Helm Upgrade Broke Prod — Recover Fast
- Scenario: Ingress Returns 404 Intermittently
- Scenario: Linux Server Running Slow
- Scenario: Logs Disappeared from Grafana Loki