GCP PCA · Question 27 · Domain 4: Analyzing and Optimizing Technical and Business Processes
Your organization has adopted Site Reliability Engineering (SRE) practices. The Service Level Objective (SLO) for your e-commerce checkout service is 99.9% availability over a 30-day rolling window. Currently, the service has experienced several outages, and the error budget has been completely exhausted. According to SRE best practices, what action should the team take?
Your organization has adopted Site Reliability Engineering (SRE) practices. The Service Level Objective (SLO) for your e-commerce checkout service is 99.9% availability over a 30-day rolling window. Currently, the service has experienced several outages, and the error budget has been completely exhausted. According to SRE best practices, what action should the team take?
Answer options:
Lower the SLO to 99.0% so the team is no longer in violation.
Halt all new feature deployments and focus engineering efforts exclusively on reliability and technical debt until the error budget recovers.
Fire the engineers responsible for the outages to enforce accountability.
Ignore the error budget and continue deploying features, as feature velocity is the most important metric for the business.
How to approach this question
Full Answer
Common mistakes
Practice the full GCP Professional Cloud Architect Practice Exam 3
50 questions · hints · full answers · grading
Expert