Hard1 markMultiple Choice
Subtask 4.3: Resilience ProceduresSREError BudgetsSLOReliability

GCP PCA · Question 30 · Resilience Procedures

Your organization has adopted Site Reliability Engineering (SRE) practices. The development team wants to push a major new feature to production, but the operations team notes that the application has consumed 110% of its error budget for the current 30-day window due to recent outages. According to SRE principles, what should happen next?

Answer options:

A.

Deploy the feature anyway, as feature velocity is the primary goal of DevOps.

B.

Increase the error budget by lowering the Service Level Objective (SLO) so the deployment can proceed.

C.

Halt all new feature deployments and redirect engineering effort toward improving system reliability until the error budget recovers.

D.

Deploy the feature to a small subset of users using a canary release to minimize risk.

How to approach this question

Understand the core SRE concept of Error Budgets and the consequences of exhausting them.

Full Answer

C.Halt all new feature deployments and redirect engineering effort toward improving system reliability until the error budget recovers.✓ Correct
Halt all new feature deployments and redirect engineering effort toward improving system reliability until the error budget recovers.
An error budget is the acceptable level of unreliability a system can have (e.g., if SLO is 99.9%, the error budget is 0.1%). It aligns dev and ops teams. When the error budget is exhausted, the agreed-upon policy in SRE is to freeze new feature releases and pivot all engineering efforts to reliability tasks (paying down technical debt) until the budget resets.

Common mistakes

Choosing Canary Release (Option D). Even safe deployments introduce change, and change introduces risk. When the budget is gone, risk must be minimized to zero.

Practice the full GCP Professional Cloud Architect Practice Exam 7

50 questions · hints · full answers · grading

More questions from this exam