Skip to content

Datacenter Operations Case Studies

20 incident case studies for hands-on troubleshooting practice.

Case Study Description
Bios Settings Reset After Cmos BIOS Settings Reverted After CMOS Battery Replacement
Bmc Clock Skew Cert Failure BMC Clock Skew - Certificate Failure
Bonding Failover Not Working Network Bonding Failover Not Working
Cable Management Wrong Port Server Cabled to Wrong Switch Port / Wrong VLAN
Disk Full Root Services Down Disk Full Root - Services Down
Firmware Update Boot Loop Firmware Update Boot Loop
Hba Firmware Mismatch HBA Firmware Mismatch Causing I/O Errors
Idrac Unreachable Os Up iDRAC Unreachable, OS Up
Link Flaps Bad Optic Link Flaps - Bad Optic
Memory Ecc Errors Increasing Memory ECC Errors Increasing
Nvme Drive Disappeared NVMe Drive Disappeared After Reboot
Os Install Fails Raid Controller OS Installation Cannot See Disks
Power Supply Redundancy Lost Power Supply Redundancy Lost
Pxe Boot Fails Uefi Mismatch PXE Boot Fails - UEFI Mismatch
Rack Pdu Overload Alert PDU Reporting Overload Warning
Raid Degraded Rebuild Latency RAID Degraded Rebuild Latency
Serial Console Garbled Serial-over-LAN Output Garbled
Server Intermittent Reboot Server Randomly Rebooting Every Few Hours
Server Remote Console Lag Remote KVM/Console Extremely Laggy
Thermal Throttle Fan Failure Thermal Throttle - Fan Failure