What you will learn
- The core principles of Site Reliability Engineering.
- How to design automation strategies, perform operational readiness reviews, employ cost-optimization strategies, and manage backups and recoveries.
- How to identify key metrics and measure service health using cloud monitoring techniques.
- How to identify and manage incidents, develop action plans to mitigate future risk, and perform post incident reviews.
- The key concepts to monitor and manage security threats.
- How to troubleshoot common IBM Cloud issues.
- How to design and improve reliability for systems and cloud services and employ best practices to automate deployments.
Program Overview
Expert instruction
3 skill-building courses
Self-paced
Progress at your own speed
4 months
2 - 3 hours per week
$297
USD
For the full program experience
Courses in this program
IBM's Site Reliability Engineering (SRE) Professional Certificate
- SRE Fundamentals and Security -
- SRE Infrastructure, Resiliency and Deployment Automation
- SRE Capstone
- Job Outlook
Meet your instructor from IBM
Experts from IBM committed to teaching online learning
Get started in computer science
Browse other computer science coursesWhether you are looking to accelerate your career, earn a degree, or learn something for personal reasons, edX has the courses for you.