GCP PCA · Question 30 · Resilience Procedures
Your organization has adopted Site Reliability Engineering (SRE) practices. The development team wants to push a major new feature to production, but the operations team notes that the application has consumed 110% of its error budget for the current 30-day window due to recent outages. According to SRE principles, what should happen next?
Your organization has adopted Site Reliability Engineering (SRE) practices. The development team wants to push a major new feature to production, but the operations team notes that the application has consumed 110% of its error budget for the current 30-day window due to recent outages. According to SRE principles, what should happen next?
Answer options:
Deploy the feature anyway, as feature velocity is the primary goal of DevOps.
Increase the error budget by lowering the Service Level Objective (SLO) so the deployment can proceed.
Halt all new feature deployments and redirect engineering effort toward improving system reliability until the error budget recovers.
Deploy the feature to a small subset of users using a canary release to minimize risk.
How to approach this question
Full Answer
Common mistakes
Practice the full GCP Professional Cloud Architect Practice Exam 7
50 questions · hints · full answers · grading
Expert