15 questions across 6 exams
An architect is reviewing a legacy application running on EC2 instances. The application frequently experiences CPU exhaustion, leading to downtime. The architect wants to implement a self-healing architecture. What is the BEST approach?
A company wants to proactively test the resilience of their AWS infrastructure by intentionally injecting failures, such as terminating EC2 instances or simulating network latency. Which AWS service should they use to orchestrate these chaos engineering experiments safely?
A company is designing a highly available architecture for a stateful legacy application that cannot be containerized. The application runs on EC2 instances and requires a persistent IP address. If an instance fails, a new one must be launched automatically and retain the same IP address. Which TWO actions should the architect take? (Select TWO)
An architect is reviewing an existing architecture consisting of EC2 instances behind an Application Load Balancer. The instances process messages from an SQS queue. During traffic spikes, the queue backs up and messages are delayed. How can the architect improve the reliability and elasticity of this system?
A company wants to implement a chaos engineering practice to test the resilience of their AWS infrastructure. They want to simulate EC2 instance terminations, RDS failovers, and network latency. Which AWS service should they use?
A company is running a critical database on an EC2 instance. They want to ensure that if the underlying hardware fails, the instance is automatically recovered on new hardware with the same IP address, EBS volume attachments, and instance ID. What is the simplest way to achieve this?
A company wants to test the resilience of their microservices architecture by intentionally injecting faults, such as terminating EC2 instances and simulating network latency. Which AWS service should they use?
A company wants to improve the reliability of their application by implementing a multi-region active-passive architecture. Which THREE AWS services/features are essential for automating the failover process? (Select THREE)
A microservices architecture uses Amazon ECS and Amazon RDS. Occasionally, the RDS database experiences brief failovers or network blips, causing the ECS services to crash and restart, leading to a poor user experience. The architect wants to improve the reliability of the application so it can handle these transient database errors gracefully. Which strategies should be implemented? (Select TWO)
A company wants to implement Chaos Engineering to test the resilience of their AWS infrastructure. They want to simulate an Availability Zone power outage to ensure their Auto Scaling groups and RDS Multi-AZ failovers work as expected. They must ensure that these experiments do not accidentally cause a complete system outage. Which combination of actions should they take? (Select TWO)
A company wants to implement a chaos engineering practice to test the resilience of their AWS infrastructure. They want to simulate EC2 instance terminations, RDS database failovers, and network latency injections. The testing must be automated, safely scoped to specific tagging environments (e.g., Environment=Staging), and automatically stop if CloudWatch alarms indicate severe application degradation. Which combination of actions should the Architect take? (Select THREE)
A company is designing a highly available architecture for a stateful legacy application. The application runs on EC2 instances and stores session state on the local EBS volumes. The application cannot be rewritten to use an external session store (like ElastiCache). The architecture must ensure that if an EC2 instance fails, a new instance is launched and the exact same EBS volume (with the session state) is attached to it automatically. Which combination of services and features should be used? (Select TWO)
An application uses Amazon EC2 instances in an Auto Scaling group behind an ALB. During sudden, massive spikes in traffic, the Auto Scaling group takes 5 minutes to launch new instances, during which time the application becomes unresponsive. Which TWO strategies will improve reliability during these spikes? (Select TWO)
A company is redesigning a monolithic application to improve reliability. The application currently processes user uploads synchronously, often timing out. The architect wants to decouple the architecture. Which THREE services can be used together to build a loosely coupled, asynchronous processing pipeline? (Select THREE)
An enterprise database runs on a single large EC2 instance. The business requires improving reliability to ensure high availability across Availability Zones and automated failover, while minimizing administrative overhead. Which THREE steps should the architect take? (Select THREE)
Full answers, grading, and explanations on why each answer is correct.