AWS SAP-C02 · Question 38 · Domain 2.4: Reliability
A data processing pipeline uses AWS Step Functions to orchestrate AWS Batch jobs. Occasionally, a Batch job fails due to a transient API error. The architect wants to ensure the workflow automatically retries the failed job before marking the execution as failed. What is the BEST approach?
Answer options:
Write a custom script in the Batch job to catch errors and restart itself.
Configure a Retry block within the Step Functions state machine definition for the specific Batch job state.
Use EventBridge to detect the Batch failure and trigger a Lambda to restart the Step Function.
Configure the AWS Batch job definition with a retry strategy.
75 questions · hints · full answers · grading