You are advising a traditional IT operations team that is transitioning to a Site Reliability Engineering (SRE) model. They are currently overwhelmed by manual alerts and spend most of their time firefighting. Which TWO SRE practices should you recommend they implement first to reduce toil and improve reliability? (Select TWO)

Answer options:

Implement blameless postmortems for every significant incident.

Fire the operations team and force the developers to carry the pager.

Create an alert for every single CPU spike over 80%.

Define Service Level Objectives (SLOs) and only page engineers when the error budget is threatened.

Implement a strict ITIL change management process with a 2-week approval window.

How to approach this question

Look for core SRE principles: learning from failure without blame, and alerting based on user experience rather than system metrics.

Full Answer

To transition to SRE and reduce toil, teams must change how they handle incidents and alerts. Blameless postmortems focus on fixing the system rather than punishing individuals, ensuring that action items are created to prevent recurrence. Defining SLOs and alerting based on error budget burn rates ensures that engineers are only woken up when the user experience is genuinely impacted, eliminating the noise of meaningless CPU or memory alerts.

Common mistakes

Choosing Option C. Alerting on CPU is a classic traditional IT mistake that leads to alert fatigue. High CPU is fine if the application is still serving requests quickly.

Question 44 All questions Question 46

Practice the full GCP Professional Cloud Architect Practice Exam 7

50 questions · hints · full answers · grading

More questions from this exam

Q01CASE STUDY: TechStream Gaming Company Overview: TechStream Gaming is a global multiplayer game d...Hard Q02CASE STUDY: TechStream Gaming Company Overview: TechStream Gaming is a global multiplayer game d...Medium Q03CASE STUDY: TechStream Gaming Company Overview: TechStream Gaming is a global multiplayer game d...Medium Q04CASE STUDY: TechStream Gaming Company Overview: TechStream Gaming is a global multiplayer game d...Medium Q05CASE STUDY: TechStream Gaming Company Overview: TechStream Gaming is a global multiplayer game d...Medium

View all 50 questions →