From the course: DevOps Foundations: Site Reliability Engineering
Unlock the full course today
Join today to access over 24,300 courses taught by industry experts.
Troubleshooting
From the course: DevOps Foundations: Site Reliability Engineering
Troubleshooting
- [Man In Gray] Troubleshooting, everyone does it. It's an inherent part of fixing service issues. - But how do you do it? While it's a ubiquitous practice, most people still fly by the seat of their pants. They try what worked last time. They poke here and there to see what happens. And when all else fails, reboot some servers. - You know, some of us have built an entire career out of restarting all the things. But we can do better by developing a troubleshooting methodology. - Troubleshooting is essentially a form of inquiry. What's happening within this complex system? - Luckily, there's a hot new technique to use in this case. - [Man With Glasses] It was developed by an internet startup techno hipster named Aristotle and it's called the scientific method. - [Man In Gray] Now you can dress it up all you want but the heart of successful troubleshooting is the same process you learned in science class. - [Man With…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
Release engineering5m 12s
-
(Locked)
Change management2m 55s
-
(Locked)
Self-service automation4m 46s
-
(Locked)
SLAs and SLOs5m 21s
-
(Locked)
Incident management5m 43s
-
(Locked)
Introducing postmortems3m 29s
-
(Locked)
The postmortem process4m 3s
-
(Locked)
Troubleshooting5m 58s
-
(Locked)
Performance engineering5m 36s
-
(Locked)
Capacity and scalability5m 21s
-
(Locked)
Distributed design5m 2s
-
(Locked)
Deliberate adversity3m 57s
-
-
-