Skip to content

Asset Registry Inventory

Every registered learning asset, grouped by domain and type.

This is a maintainer view

For browsing content, use the Content Hub instead. This page lists what is registered in assets.yaml, which is a subset of all content.

Navigation: Content Hub | By level | By topic | Search

Linux

Topic Packs

Asset Level Time
/proc Filesystem L2 3h
Advanced Bash for Ops L1 5h
Binary and Floats L1 2h
Cron & Job Scheduling L1 4h
DNF Package Manager L2 3h
Debian & Ubuntu Ecosystem L1 6h
Disk & Storage Ops L1 5h
Environment Variables L1 2h
Inodes L1 2h
Kernel Troubleshooting L3 6h
LPIC / LFCS Exam Preparation L2 30h
Linux Boot Process L1 3h
Linux Data Hoarding L2 4h
Linux Distribution Comparison L1 4h
Linux Kernel Tuning L2 4h
Linux Logging L1 4h
Linux Memory Management L1 4h
Linux Ops L0 5h
Linux Ops Storage L1 2h
Linux Ops Systemd L1 2h
Linux Performance Tuning L2 7h
Linux Signals & Process Control L1 3h
Linux Text Processing L1 3h
Linux Users & Permissions L1 4h
Mounts Filesystems L1 2h
Package Management L1 2h
Pipes & Redirection L1 3h
Process Management L1 4h
SSH Deep Dive L1 4h
Terminal Internals L1 2h
cgroups & Linux Namespaces L2 4h
eBPF & Modern Linux Observability L3 5h
iptables & nftables L1 4h
mergerfs L2 3h
perf Profiling L2 2h
rsync L1 3h
strace L1 2h
systemctl & journalctl Deep Dive L1 4h
tar & Compression L1 2h
tmux & screen L1 2h

Exercise Sets

Asset Level Time
Bash Exercises (Quest Ladder) (CLI) L0 8h

Runbooks

Asset Level Time
Runbook: Disk Full L1 15m
Runbook: High CPU (Runaway Process) L1 15m
Runbook: OOM Killer Activated L1 15m
Runbook: Systemd Service Crash Loop L1 15m
Runbook: Zombie Processes Accumulating L2 15m

Scenarios

Asset Level Time
Interview: Linux Server Slow L1 15m

Assessments

Asset Level Time
Skillcheck: Bash L0 30m
Skillcheck: Linux Fundamentals L0 30m

Drills

Asset Level Time
Linux Ops Drills L0 30m

References

Asset Level Time
Track: Foundations L0 20h

Deep Dives

Asset Level Time
Deep Dive: Linux Boot Sequence L2 45m
Deep Dive: Linux Filesystem Internals L2 45m
Deep Dive: Linux Memory Management L2 45m
Deep Dive: Linux Network Packet Flow L2 45m
Deep Dive: Linux Performance Debugging L2 45m
Deep Dive: Linux Process Scheduler L3 45m
Deep Dive: Systemd Architecture L2 45m
Deep Dive: Systemd Service Design Debugging and Hardening L2 45m
Deep Dive: Systemd Timers Journald Cgroups and Resource Control L2 45m
Deep Dive: Systemd Units Dependencies and Ordering L2 45m
Deep Dive: eBPF Explained L3 45m

Case Studys

Asset Level Time
Case Study: IPTables Blocking Unexpected L2 30m
Case Study: Inode Exhaustion L1 30m
Case Study: Kernel Soft Lockup L2 30m
Case Study: OOM Killer Events L2 30m
Case Study: Runaway Logs Fill Disk L1 30m
Case Study: SELinux Denying Service L1 30m
Case Study: Stuck NFS Mount L2 30m
Case Study: Systemd Service Flapping L1 30m
Case Study: Time Sync Skew Breaks App L2 30m
Case Study: Zombie Processes Accumulating L1 30m

Networking

Topic Packs

Asset Level Time
ARP L1 2h
BGP EVPN / VXLAN L2 5h
Cisco Fundamentals for DevOps L1 4h
DHCP & IP Address Management L1 4h
DNS Deep Dive L1 4h
DNS Operations L2 4h
DNSSEC & DNS Security L2 4h
Email Infrastructure L1 3h
Firewalls L1 2h
HAProxy & Nginx for Ops L2 4h
HTTP Protocol L0 3h
LACP L1 2h
MTU L1 2h
Mellanox Switches L2 4h
NAT L1 2h
Network Automation L2 5h
Networking Deep Dive L1 6h
Networking Troubleshooting L1 5h
Routing L1 2h
STP (Spanning Tree) L1 2h
Subnetting & IP Addressing L1 2h
TCP/IP Deep Dive L2 5h
Tailscale & Zero Trust Networking L2 3h
VLANs L1 2h
VPN & Tunneling L2 5h
Wireshark & Packet Analysis L2 4h
gRPC & Protocol Buffers L2 3h

Runbooks

Asset Level Time
Runbook: DNS Resolution Failure L1 15m
Runbook: Load Balancer Health Check Failure L2 15m
Runbook: MTU Mismatch L2 15m
Runbook: Network Partition (Split Brain / Partial Connectivity) L2 20m
Runbook: TLS Certificate Expiry L2 15m

Scenarios

Asset Level Time
Scenario: Asymmetric Routing L2 20m
Scenario: DNS Looks Fine but App Fails L2 20m
Scenario: Duplex Mismatch L1 20m
Scenario: MTU Blackhole L2 20m
Scenario: VLAN Trunk Mismatch L2 20m

Assessments

Asset Level Time
Skillcheck: Networking Fundamentals L1 30m

Drills

Asset Level Time
Networking Drills L1 30m

Deep Dives

Asset Level Time
Deep Dive: AWS VPC Internals L2 45m
Deep Dive: TCP/IP Deep Dive L2 45m
Deep Dive: TLS Handshake L2 45m

Case Studys

Asset Level Time
Case Study: ARP Flux Duplicate IP L2 30m
Case Study: Asymmetric Routing One Direction L2 30m
Case Study: BGP Peer Flapping L2 30m
Case Study: DHCP Relay Broken L1 30m
Case Study: DNS Resolution Slow L1 30m
Case Study: DNS Split Horizon Confusion L2 30m
Case Study: Duplex Mismatch Symptoms L1 30m
Case Study: Firewall Shadow Rule L2 30m
Case Study: Jumbo Frames Partial L2 30m
Case Study: LACP Mismatch One Link Hot L2 30m
Case Study: MTU Blackhole TLS Stalls L2 30m
Case Study: Multicast Not Crossing Router L2 30m
Case Study: NAT Exhaustion Intermittent L2 30m
Case Study: Network Loop Broadcast Storm L2 30m
Case Study: OSPF Stuck In Exstart L2 30m
Case Study: Proxy ARP Causing Issues L2 30m
Case Study: SSL Cert Chain Incomplete L1 30m
Case Study: Source Routing Policy Miss L2 30m
Case Study: TCP RST After Idle L2 30m
Case Study: VLAN Trunk Mistag L1 30m

Datacenter & Hardware

Topic Packs

Asset Level Time
Bare-Metal Provisioning L2 5h
Datacenter & Server Hardware L1 6h
Dell PowerEdge Servers L1 2h
Firmware L1 2h
IPMI and ipmitool L1 3h
Packer L1 2h
Power L1 2h
Redfish API L1 2h
Server Hardware L1 2h
Storage Operations L2 5h
VMware L2 4h
Virtualization L2 5h

Scenarios

Asset Level Time
Interview: Server Won't POST L2 15m
Scenario: NIC Flapping / LACP Mismatch L2 20m
Scenario: OOB Unreachable but Host Responds L2 20m
Scenario: RAID Array Degraded L1 20m
Scenario: Server Won't Boot After Update L1 20m
Scenario: Thermal Throttling L1 20m

Assessments

Asset Level Time
Skillcheck: Datacenter L1 30m

Drills

Asset Level Time
Datacenter Drills L1 30m

Deep Dives

Asset Level Time
Deep Dive: Dell Linux PowerEdge L2 45m
Deep Dive: RAID and Storage Internals L2 45m

Case Studys

Asset Level Time
Case Study: BIOS Settings Reset After CMOS L1 30m
Case Study: BMC Clock Skew Cert Failure L2 30m
Case Study: Bonding Failover Not Working L1 30m
Case Study: Cable Management Wrong Port L1 30m
Case Study: Disk Full Root Services Down L1 30m
Case Study: Firmware Update Boot Loop L2 30m
Case Study: HBA Firmware Mismatch L2 30m
Case Study: Link Flaps Bad Optic L1 30m
Case Study: Memory ECC Errors Increasing L1 30m
Case Study: NVMe Drive Disappeared L2 30m
Case Study: OS Install Fails RAID Controller L2 30m
Case Study: PXE Boot Fails UEFI Mismatch L1 30m
Case Study: Power Supply Redundancy Lost L1 30m
Case Study: RAID Degraded Rebuild Latency L2 30m
Case Study: Rack PDU Overload Alert L1 30m
Case Study: Serial Console Garbled L1 30m
Case Study: Server Intermittent Reboot L2 30m
Case Study: Server Remote Console Lag L1 30m
Case Study: Thermal Throttle Fan Failure L1 30m
Case Study: iDRAC Unreachable OS Up L1 30m

Kubernetes

Topic Packs

Asset Level Time
API Gateways & Ingress L2 5h
Argo Workflows L2 4h
Cilium & eBPF Networking L2 4h
Container Images L1 3h
CrashLoopBackOff L1 2h
Database Operations on Kubernetes L2 4h
Docker L1 2h
Envoy Proxy L2 4h
HashiCorp Consul L2 4h
Helm L1 2h
Istio Service Mesh L2 5h
K8s Ecosystem L0 6h
K8s Networking L1 2h
K8s RBAC L1 2h
K8s Storage L1 2h
Kubernetes Concept Chain L0 2h
Kubernetes Debugging Playbook L2 4h
Kubernetes Node Lifecycle L2 4h
Kubernetes Ops (Production) L2 5h
Kubernetes Pods & Scheduling L1 4h
Kubernetes Services & Ingress L1 4h
Kustomize L1 3h
Multi-Tenancy Patterns L2 5h
Node Maintenance L1 2h
OOMKilled L1 2h
Policy Engines (OPA / Kyverno) L2 4h
Progressive Delivery L2 4h
Service Mesh L3 5h
cert-manager L1 3h
etcd L1 2h

Exercise Sets

Asset Level Time
Chaos Engineering Scripts (CLI) L2 2h
Docker Exercises (Quest Ladder) (CLI) L0 8h
Incident Simulator (18 scenarios) (CLI) L2 4h
Investigation Engine (CLI) L2 2h
Kubernetes Exercises (Quest Ladder) (CLI) L1 10h

Labs

Asset Level Time
Lab: HPA Live Scaling (CLI) L1 30m
Lab: Readiness Probe Failure (CLI) L1 30m
Lab: Resource Limits OOMKilled (CLI) L1 30m

Runbooks

Asset Level Time
Runbook: Deployment Stuck / Rollout Stalled L1 15m
Runbook: Disaster Recovery L2 20m
Runbook: HPA Not Scaling L2 15m
Runbook: HPA Thrashing (Rapid Scale Up/Down) L2 15m
Runbook: ImagePullBackOff L1 15m
Runbook: Ingress 404 L1 15m
Runbook: Ingress 502 Bad Gateway L2 15m
Runbook: Istio 503 Errors L3 15m
Runbook: Kyverno Blocking Workloads L2 15m
Runbook: NetworkPolicy Block L2 15m
Runbook: Node NotReady L1 15m
Runbook: OOMKilled Container L1 15m
Runbook: PVC Stuck in Pending L1 15m
Runbook: Pod CrashLoopBackOff L1 15m
Runbook: Pod Eviction L2 15m
Runbook: RBAC Forbidden L2 15m
Runbook: Readiness Probe Failed L1 15m
Runbook: Velero Backup & Restore L2 15m
Runbook: etcd Backup & Restore L2 20m
Runbook: etcd High Latency / Slow Operations L3 20m

Scenarios

Asset Level Time
Interview: Database Failover During Deploy L3 15m
Interview: Deployment Stuck Progressing L2 15m
Interview: Docker Container Debugging L1 15m
Interview: HPA Not Scaling L2 15m
Interview: Ingress 404 L2 15m
Interview: Kyverno Blocking Deploys L2 15m
Interview: Pods OOMKilled L2 15m
Interview: RBAC Forbidden L2 15m
Interview: Service Mesh 503s L3 15m
Interview: etcd Space Exceeded L3 15m
Scenario: etcd Troubleshooting L3 30m

Assessments

Asset Level Time
Skillcheck: Container Runtime Debug L2 30m
Skillcheck: Database Ops L2 30m
Skillcheck: Docker L0 30m
Skillcheck: Kubernetes L1 45m
Skillcheck: Kubernetes Operators L3 30m
Skillcheck: Kubernetes Under the Covers L2 45m
Skillcheck: Policy Engines L2 30m
Skillcheck: Service Mesh L3 30m
Skillcheck: etcd L2 30m

Drills

Asset Level Time
Container Runtime Drills L2 30m
Database Ops Drills L2 30m
Docker Drills L1 30m
Kubernetes Operators Drills L3 30m
Policy Engine Drills L2 30m
Service Mesh Drills L3 30m
etcd Drills L2 30m
kubectl Drills L1 45m

References

Asset Level Time
Track: Containers L0 10h
Track: Kubernetes Core L1 15h
kubectl Debugging Cheatsheet L1 15m

Deep Dives

Asset Level Time
Deep Dive: Kubernetes Networking L2 45m
Deep Dive: Kubernetes Pod Lifecycle L2 45m
Deep Dive: Kubernetes Scheduler L3 45m

Case Studys

Asset Level Time
Case Study: CNI Broken After Restart L2 30m
Case Study: CoreDNS Timeout Pod DNS L2 30m
Case Study: CrashLoopBackOff No Logs L1 30m
Case Study: DaemonSet Blocks Eviction L2 30m
Case Study: Drain Blocked by PDB L2 30m
Case Study: ImagePullBackOff Registry Auth L1 30m
Case Study: Node Pressure Evictions L2 30m
Case Study: Persistent Volume Stuck Terminating L2 30m
Case Study: Resource Quota Blocking Deploy L2 30m
Case Study: Service No Endpoints L1 30m

DevOps & Tooling

Topic Packs

Asset Level Time
AI Tools for DevOps L1 4h
Ansible Automation L1 5h
Ansible Deep Dive L2 5h
ArgoCD & GitOps L2 5h
Automation L1 1h
Backstage & Developer Portals L2 3h
CI/CD Pipelines & Patterns L1 8h
CSS Fundamentals L0 2h
Capacity Planning L2 4h
Career Engineering for Ops People L0 4h
Ceph Storage L2 5h
Certificates L2 1h
Change Management L1 4h
Chaos Engineering & Fault Injection L2 5h
Claude Code L1 2h
Cloud Ops Basics L1 4h
Configuration Management L1 1h
Containers Deep Dive L1 5h
Corporate IT Fluency for Engineers L0 3h
Crossplane L2 3h
DORA Metrics & DevEx L1 3h
Dagger / CI as Code L2 2h
Data Modeling L2 1h
Database Internals L1 2h
Databases L2 1h
Debugging Methodology L1 4h
Deployments L1 1h
Distributed Systems Fundamentals L2 5h
Edge & IoT Infrastructure L2 4h
Elasticsearch L1 2h
Feature Flags L1 3h
FinOps & Cost Optimization L2 4h
Fleet Operations at Scale L2 5h
Git Advanced L2 5h
Git Workflows & Branching Strategies L2 3h
Git for DevOps L0 3h
GitHub Actions L1 4h
GitOps L1 2h
GraphQL L2 4h
Homelab & Learning Infrastructure L0 5h
Incident Command & On-Call L2 4h
Incident Triage L1 2h
Infrastructure Testing L2 4h
Kafka L1 2h
Legacy System Archaeology L1 4h
Load Testing L1 3h
Make & Build Systems L1 3h
Mental Models (Core Concepts) L0 4h
Message Queues L2 5h
MongoDB Operations L1 4h
MySQL / MariaDB Operations L1 4h
Nginx & Web Servers L1 5h
Nix / NixOS L2 4h
On-Call L2 1h
OpenTofu & Terraform Ecosystem L2 2h
Ops War Stories & Pattern Recognition L2 4h
Performance L2 1h
Platform Engineering Patterns L2 5h
PostgreSQL Operations L2 4h
Postmortems & SLOs L2 4h
PowerShell L1 2h
Pulumi L2 3h
Python Async & Concurrency L2 4h
Python Debugging L1 3h
Python Packaging L2 3h
Python for Infrastructure L1 5h
RHCE (EX294) Exam Preparation L2 40h
RabbitMQ & Message Queues L2 3h
Redis Operations L2 3h
Reliability Patterns L2 1h
Runbook Craft L1 4h
S3-Compatible Object Storage L1 3h
SQL Fundamentals L0 2h
SQLite Operations & Internals L2 3h
SRE Practices L2 5h
Systems Thinking for Engineers L1 4h
Terraform / IaC L1 5h
Terraform Deep Dive L2 5h
The Ops of AI/ML Workloads L2 4h
The Psychology of Incidents L2 4h
Toil Reduction L2 1h
VS Code for DevOps L0 3h
Vendor Management & Escalation L1 3h
WebAssembly for Infrastructure L3 3h

Exercise Sets

Asset Level Time
Ansible Exercises (Quest Ladder) (CLI) L1 8h
Python Exercises (Quest Ladder) (CLI) L0 8h

Labs

Asset Level Time
Ansible Lab: Conditionals and Loops L1 25m
Ansible Lab: Facts and Variables L0 20m
Ansible Lab: Install Nginx (Idempotency) L1 20m
Ansible Lab: Ping and Debug L0 15m
Ansible Lab: Roles L1 20m
Ansible Lab: Templates and Handlers L1 25m
Ansible Lab: Vault (Secrets Management) L2 25m
Lab: GitOps Sync and Drift (CLI) L2 30m
Lab: Helm Upgrade Rollback (CLI) L1 30m

Runbooks

Asset Level Time
Runbook: Ansible Playbook Failure L1 15m
Runbook: ArgoCD Out of Sync L2 15m
Runbook: Build Failure Triage L1 15m
Runbook: Container Registry Pull Failure L1 15m
Runbook: Deploy Rollback L1 15m
Runbook: Helm Upgrade Failed L1 15m
Runbook: Long-Running Query / Lock Contention L2 15m
Runbook: Pipeline Stuck / Hung Job L1 15m
Runbook: PostgreSQL Connection Exhaustion L2 15m
Runbook: PostgreSQL Disk Space Critical L2 15m
Runbook: PostgreSQL Replication Lag L2 15m
Runbook: Terraform Drift Detection Response L2 15m
Runbook: Terraform State Lock Stuck L2 15m

Scenarios

Asset Level Time
Adversarial Interview Gauntlet (30 sequences) L2 15h
Break/Fix: Handler Name Mismatch L1 10m
Break/Fix: Jinja2 Syntax Error in Template L1 15m
Break/Fix: Privilege Escalation Missing L1 10m
Break/Fix: Task Ordering / Dependency L1 15m
Break/Fix: Undefined Variable + Bare Jinja2 L1 15m
Break/Fix: Wrong Host Scope L1 10m
Break/Fix: Wrong Module Parameter L0 10m
Break/Fix: YAML Indentation Error L0 10m
Interview: Config Drift Detected L2 15m
Interview: Cost Spike Investigation L2 15m
Interview: GitOps Drift Detected L2 15m
Interview: Helm Upgrade Broke Prod L2 15m

Assessments

Asset Level Time
Skillcheck: Ansible L1 30m
Skillcheck: CI/CD L1 30m
Skillcheck: Cloud Basics L1 30m
Skillcheck: DevOps Roadmap (Expanded) L1 45m
Skillcheck: FinOps L2 30m
Skillcheck: Git L0 30m
Skillcheck: GitOps L2 30m
Skillcheck: Helm & Release Ops L1 30m
Skillcheck: Postmortems & SLOs L2 30m
Skillcheck: Python Automation L0 30m
Skillcheck: Terraform / IaC L1 30m

Drills

Asset Level Time
Ansible Drills L1 30m
CI/CD Drills L1 30m
FinOps Drills L2 30m
Git Drills L0 30m
GitOps & ArgoCD Drills L2 30m
Helm Drills L1 30m
Postmortem & SLO Drills L2 30m
Python Drills L0 30m
Terraform Drills L1 30m

References

Asset Level Time
AI-Assisted DevOps Cookbook L1 2h
CI Pipeline Documentation L1 15m
DevOps Learning Roadmap L0 15m
Mental-Model-First Learning Guide L0 30m
Topic Tag Cloud & Content Index L0 10m
Track: Helm & Release Ops L1 8h
Track: Incident Response L2 10h
Track: Infrastructure L1 12h

Deep Dives

Asset Level Time
Deep Dive: CI/CD Pipeline Architecture L2 45m
Deep Dive: Containers How They Really Work L2 45m
Deep Dive: Docker Image Internals L2 45m
Deep Dive: Terraform State Internals L2 45m

Case Studys

Asset Level Time
Case Study: API Latency Spike — BGP Route Leak, Fix Is Network ACL L2 30m
Case Study: Alert Storm — Flapping Health Checks L2 30m
Case Study: Ansible Playbook Hangs — SSH Agent Forwarding Blocked by Firewall L2 30m
Case Study: Backup Job Failing — iSCSI Target Unreachable, VLAN Misconfigured L2 30m
Case Study: CI Pipeline Fails — Docker Layer Cache Corruption L2 30m
Case Study: Canary Deploy Routing to Wrong Backend — Ingress Misconfigured L2 30m
Case Study: Container Vuln Scanner False Positive Blocks Deploy L2 30m
Case Study: DNS Looks Broken — TLS Expired, Fix Is Cert-Manager L2 30m
Case Study: Database Replication Lag — Root Cause Is RAID Degradation L2 30m
Case Study: Deployment Stuck — ImagePull Auth Failure, Vault Secret Rotation L2 30m
Case Study: Disk Full — Runaway Logs, Fix Is Loki Retention L2 30m
Case Study: Grafana Dashboard Empty — Prometheus Blocked by NetworkPolicy L2 30m
Case Study: HPA Flapping — Metrics Server Clock Skew, Fix Is NTP L2 30m
Case Study: Job Queue Backlog — Worker Pod CPU Throttled by cgroup L2 30m
Case Study: Node NotReady — NIC Firmware Bug, Fix Is Ansible Playbook L2 30m
Case Study: Pod OOMKilled — Memory Leak in Sidecar, Fix Is Helm Values L2 30m
Case Study: SSH Timeout — MTU Mismatch, Fix Is Terraform Variable L2 30m
Case Study: Service Mesh 503s — Envoy Misconfigured, RBAC Policy L2 30m
Case Study: Terraform Apply Fails — State Lock Stuck, DynamoDB Throttle L2 30m
Case Study: User Auth Failing — OIDC Cert Expired, Cloud KMS Rotation L2 30m
Ops Archaeology: The 5% That Can't Resolve L2 30m
Ops Archaeology: The Alerts That Stopped Firing L2 30m
Ops Archaeology: The Certificate That Works Sometimes L2 30m
Ops Archaeology: The Cluster That Disagrees With Itself L2 30m
Ops Archaeology: The Container That Exits Immediately L1 30m
Ops Archaeology: The DR That Looks Ready But Isn't L2 30m
Ops Archaeology: The Deploy That Didn't Deploy L1 30m
Ops Archaeology: The Gateway That Returns 502 L1 30m
Ops Archaeology: The Job That Succeeded Wrong L2 30m
Ops Archaeology: The Pods That Won't Schedule L1 30m
Ops Archaeology: The Replica That Fell Behind L2 30m
Ops Archaeology: The Requests That Vanish L2 30m
Ops Archaeology: The Service That Won't Start L1 30m
Ops Archaeology: The Session Store That Keeps Dying L2 30m
Ops Archaeology: The Slow Death Nobody Noticed L2 30m

CLI Tools

Topic Packs

Asset Level Time
Modern CLI Tools L0 3h
Modern Cli Workflows L1 1h
Regex & Text Wrangling L1 5h
YAML, JSON & Config Formats L1 3h
awk: The Record/Field Processor L1 2h
curl & wget L1 3h
fd L1 2h
find L1 3h
fzf L1 2h
grep & Regular Expressions L1 4h
jq L1 2h
ripgrep L1 2h
sed: The Stream Editor L1 2h
xargs L1 3h

Assessments

Asset Level Time
Skillcheck: Modern CLI Tools L0 30m

Drills

Asset Level Time
Modern CLI Drills L0 30m

Observability

Topic Packs

Asset Level Time
Alerting Rules L2 4h
Continuous Profiling L2 4h
Log Pipelines L2 5h
Monitoring Fundamentals L1 5h
Monitoring Migration (Legacy to Modern) L2 5h
Observability Deep Dive L2 6h
OpenTelemetry L2 5h
Prometheus Deep Dive L2 5h
SLO Tooling L2 4h
Synthetic Monitoring L1 3h
Tracing L1 2h

Labs

Asset Level Time
Lab: Loki No Logs (CLI) L2 30m
Lab: Prometheus Target Down (CLI) L2 30m

Runbooks

Asset Level Time
Runbook: Alert Storm (Flapping / Too Many Alerts) L2 15m
Runbook: Grafana Dashboard Blank / No Data L1 15m
Runbook: Log Pipeline Backpressure / Logs Not Appearing L2 15m
Runbook: Loki No Logs L2 15m
Runbook: Prometheus Target Down L1 15m
Runbook: Tempo No Traces L2 15m

Scenarios

Asset Level Time
Interview: Loki Logs Disappeared L2 15m
Interview: Prometheus Target Down L2 15m

Assessments

Asset Level Time
Skillcheck: Alerting Rules L2 30m
Skillcheck: Observability L2 30m

Drills

Asset Level Time
Alerting Rules Drills L2 30m
LogQL Drills L2 30m
Observability Drills L2 30m
PromQL Drills L2 30m

References

Asset Level Time
Observability Architecture L2 30m
Track: Observability L2 10h

Security

Topic Packs

Asset Level Time
Audit Logging L1 2h
Backup Restore L1 2h
Compliance & Audit Automation L2 5h
Disaster Recovery & Backup Engineering L2 5h
HashiCorp Vault L2 4h
Infrastructure Forensics L2 5h
LDAP & Identity Management L2 5h
Offensive Security Basics L1 2h
Open Policy Agent L2 4h
Opsec Mistakes L1 2h
Runtime Security with Falco L2 4h
SELinux & AppArmor L2 5h
SELinux & Linux Hardening L2 5h
Secrets Management L2 4h
Security Basics (Ops-Focused) L1 4h
Security Scanning L1 2h
Supply Chain Security L2 3h
TLS & Certificates Ops L1 6h

Labs

Asset Level Time
Lab: Trivy Scan Remediation (CLI) L1 30m

Runbooks

Asset Level Time
Runbook: CVE Response (Critical Vulnerability) L2 20m
Runbook: Certificate Renewal Failed L2 15m
Runbook: Credential Rotation (Exposed Secret) L2 15m
Runbook: Secret Rotation L2 15m
Runbook: Unauthorized Access Investigation L2 20m

Scenarios

Asset Level Time
Interview: CI Vuln Scan Failed L2 15m
Interview: Certificate Expired L2 15m
Interview: Secret Leaked to Git L2 15m
Interview: Vault Token Expired L2 15m

Assessments

Asset Level Time
Skillcheck: Secrets Management L2 30m
Skillcheck: Security (Expanded) L2 30m
Skillcheck: TLS & PKI L2 30m

Drills

Asset Level Time
Secrets Management Drills L2 30m
Security Drills L2 30m
TLS & PKI Drills L2 30m

Cloud

Topic Packs

Asset Level Time
AWS CloudWatch L2 4h
AWS EC2 L1 4h
AWS ECS L2 4h
AWS IAM L1 4h
AWS Lambda L2 3h
AWS Networking L1 4h
AWS Route 53 L2 3h
AWS S3 Deep Dive L1 4h
AWS Troubleshooting L1 2h
Azure Troubleshooting L1 2h
Cloud Deep Dive L2 5h
GCP Troubleshooting L1 2h
Serverless Computing L2 1h

Runbooks

Asset Level Time
Runbook: Cloud Capacity Limit Hit L2 15m
Runbook: VPC IP Exhaustion L2 15m

Assessments

Asset Level Time
Skillcheck: Cloud Providers L1 30m

Drills

Asset Level Time
Cloud Deep Dive Drills L2 30m
Cloud Ops Drills L1 30m

Containers