Hard1 markMultiple Choice
Domain 3.3: Database PerformanceDomain 3PerformanceRedshift

AWS SAA-C03 · Question 46 · Domain 3.3: Database Performance

A company has petabytes of historical data stored in Amazon S3. They want to query this data using standard SQL without loading it into a database. They also want to join this S3 data with frequently accessed data stored in their Amazon Redshift cluster. What is the MOST efficient solution?

Answer options:

A.

Use Amazon Athena to query S3, and export the results to Redshift.

B.

Load all the S3 data into Amazon Redshift.

C.

Use Amazon Redshift Spectrum to query the S3 data directly.

D.

Use AWS Glue to ETL the data into Amazon RDS.

How to approach this question

Look for the feature that extends Redshift to S3.

Full Answer

C.Use Amazon Redshift Spectrum to query the S3 data directly.✓ Correct
Use Amazon Redshift Spectrum to query the S3 data directly from the Redshift cluster.
Amazon Redshift Spectrum enables you to run Amazon Redshift SQL queries against exabytes of data in Amazon S3. It allows you to join data in S3 with data in your Redshift cluster without moving the data.

Common mistakes

Choosing Athena, which cannot natively join with Redshift local tables in a single query.

Practice the full AWS SAA-C03 Practice Exam 2

65 questions · hints · full answers · grading

More questions from this exam