A company has petabytes of historical data stored in Amazon S3. They want to query this data using standard SQL without loading it into a database. They also want to join this S3 data with frequently accessed data stored in their Amazon Redshift cluster. What is the MOST efficient solution?

Answer options:

Use Amazon Athena to query S3, and export the results to Redshift.

Load all the S3 data into Amazon Redshift.

Use Amazon Redshift Spectrum to query the S3 data directly.

Use AWS Glue to ETL the data into Amazon RDS.

How to approach this question

Look for the feature that extends Redshift to S3.

Full Answer

C.Use Amazon Redshift Spectrum to query the S3 data directly.✓ Correct

Amazon Redshift Spectrum enables you to run Amazon Redshift SQL queries against exabytes of data in Amazon S3. It allows you to join data in S3 with data in your Redshift cluster without moving the data.

Common mistakes

Choosing Athena, which cannot natively join with Redshift local tables in a single query.

Question 45 All questions Question 47

Practice the full AWS SAA-C03 Practice Exam 2

65 questions · hints · full answers · grading

More questions from this exam

Q01A company wants to ensure that no AWS resources can be created in the ap-northeast-1 region acros...Easy Q02A web application runs on Amazon EC2 instances behind an Application Load Balancer (ALB). The com...Easy Q03A company is storing highly sensitive data in an Amazon S3 bucket. The security team requires tha...Medium Q04An application running on an EC2 instance needs to access an Amazon DynamoDB table in a different...Hard Q05A company needs to store database credentials securely. The credentials must be automatically rot...Medium

View all 65 questions →