AWS SAP-C02 · Question 15 · Domain 1.5: Cost Optimization
A company has a massive data lake in Amazon S3. They use Amazon Athena for querying. Over time, query performance has degraded, and costs have skyrocketed. The data is currently stored in raw CSV format, partitioned by year. What is the MOST effective strategy to improve performance and reduce costs?
Answer options:
Migrate the data to Amazon Redshift and use Redshift Spectrum to query the CSV files.
Convert the CSV files to Apache Parquet format using AWS Glue. Update the partition strategy to year, month, and day.
Enable S3 Intelligent-Tiering for the data lake bucket to reduce storage costs.
Provision an Amazon EMR cluster to run queries instead of Athena.
75 questions · hints · full answers · grading