Operations 6 min read

Unlock Reliable Services: SRE Foundation Course Highlights at GOPS 2020

The SRE Foundation course presented at the GOPS 2020 Global Operations Conference in Shenzhen introduces core Site Reliability Engineering principles, practical tools, and certification preparation through eight detailed modules, targeting a wide range of IT professionals and business stakeholders.

Efficient Ops
Efficient Ops
Efficient Ops
Unlock Reliable Services: SRE Foundation Course Highlights at GOPS 2020

The SRE Foundation course, held before the GOPS 2020 Global Operations Conference in Shenzhen, provides an introduction to Site Reliability Engineering (SRE) principles and practices, enabling organizations to scale critical services reliably and economically while adopting new engineering and automation paradigms.

Course Audience

Anyone interested in higher reliability

Those curious about modern IT leadership and organizational change

SRE engineers

Business managers

Business stakeholders

Consultants

DevOps practitioners

IT supervisors

IT managers

IT team leads

Product owners

Scrum masters

Software engineers

System integrators

Tool providers

Course Outline

Module 1: SRE Principles and Practices

What is Site Reliability Engineering?

Differences between SRE and DevOps

SRE principles and conventions

Module 2: Service Level Objectives and Error Budgets

Service Level Objectives (SLO)

Error budgets

Error budget policies

Module 3: Reducing toil

What is toil?

Why is it burdensome?

Module 4: Monitoring and Service Level Indicators

Service Level Indicators (SLI)

Monitoring

Observability

Module 5: SRE Tools and Automation

Definition of automation

Automation focus

Hierarchy of automation types

Security automation

Automation tools

Module 6: Antifragility and Learning from Failure

Why learn from failure

Benefits of antifragility

Shifting organizational balance

Module 7: Organizational Impact of SRE

Why organizations adopt SRE

Adoption patterns

On‑call practices

Post‑mortems and retrospectives

SRE at scale

Module 8: SRE and Other Frameworks

SRE compared with other frameworks

Future directions

Additional resources

Exam preparation

Exam requirements, weighting, and glossary

Sample exam review

Learning Objectives

Understand the history of SRE and its practice at Google

Explore the relationship between SRE, DevOps, and other popular frameworks

Grasp the fundamental principles behind SRE

Comprehend Service Level Objectives (SLO) and their user focus

Learn about Service Level Indicators (SLI) and modern monitoring environments

Understand error budgets and related policies

Recognize how observability indicates service health

Identify SRE tools, automation techniques, and the importance of security

Apply concepts of antifragility, failure testing, and learning from failures

Assess the organizational impact of introducing SRE

Conference Details

The GOPS Global Operations Conference, co‑hosted by GreatOPS and OOPSA under the guidance of the Ministry of Industry and Information Technology’s Data Center Alliance, is the premier operations industry event in China. The 14th edition took place on September 25‑26, 2020 in Shenzhen, focusing on AIOps, operations automation, and DevOps, and has attracted over 60,000 participants across its history.

monitoringautomationoperationsDevOpsSRESite Reliability Engineering
Efficient Ops
Written by

Efficient Ops

This public account is maintained by Xiaotianguo and friends, regularly publishing widely-read original technical articles. We focus on operations transformation and accompany you throughout your operations career, growing together happily.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.