Questions and Answers

Question ieiNW4gZCJHilOCQ33SX

Question

A data engineer is configuring an AWS Glue job to read data from an Amazon S3 bucket. The data engineer has set up the necessary AWS Glue connection details and an associated IAM role. However, when the data engineer attempts to run the AWS Glue job, the data engineer receives an error message that indicates that there are problems with the Amazon S3 VPC gateway endpoint. The data engineer must resolve the error and connect the AWS Glue job to the S3 bucket. Which solution will meet this requirement?

Choices

  • A: Update the AWS Glue security group to allow inbound traffic from the Amazon S3 VPC gateway endpoint.
  • B: Configure an S3 bucket policy to explicitly grant the AWS Glue job permissions to access the S3 bucket.
  • C: Review the AWS Glue job code to ensure that the AWS Glue connection details include a fully qualified domain name.
  • D: Verify that the VPC’s route table includes inbound and outbound routes for the Amazon S3 VPC gateway endpoint.

Question x2srhrlPFFMG1EyGFMrw

Question

A data engineer needs to create an AWS Lambda function that converts the format of data from .csv to Apache Parquet. The Lambda function must run only if a user uploads a .csv file to an Amazon S3 bucket. Which solution will meet these requirements with the LEAST operational overhead?

Choices

  • A: Create an S3 event notification that has an event type of s3:ObjectCreated:*. Use a filter rule to generate notifications only when the suffix includes .csv. Set the Amazon Resource Name (ARN) of the Lambda function as the destination for the event notification.
  • B: Create an S3 event notification that has an event type of s3:ObjectTagging:* for objects that have a tag set to .csv. Set the Amazon Resource Name (ARN) of the Lambda function as the destination for the event notification.
  • C: Create an S3 event notification that has an event type of s3:*. Use a filter rule to generate notifications only when the suffix includes .csv. Set the Amazon Resource Name (ARN) of the Lambda function as the destination for the event notification.
  • D: Create an S3 event notification that has an event type of s3:ObjectCreated:*. Use a filter rule to generate notifications only when the suffix includes .csv. Set an Amazon Simple Notification Service (Amazon SNS) topic as the destination for the event notification. Subscribe the Lambda function to the SNS topic.

Question GCnQ3QXxxgqP7T4hz1B3

Question

An insurance company stores transaction data that the company compressed with gzip.

The company needs to query the transaction data for occasional audits.

Which solution will meet this requirement in the MOST cost-effective way?

Choices

  • A: Store the data in Amazon Glacier Flexible Retrieval. Use Amazon S3 Glacier Select to query the data.
  • B: Store the data in Amazon S3. Use Amazon S3 Select to query the data.
  • C: Store the data in Amazon S3. Use Amazon Athena to query the data.
  • D: Store the data in Amazon Glacier Instant Retrieval. Use Amazon Athena to query the data.

Question Qra1B2c2PlT9gWAfdXvt

Question

A data engineer finished testing an Amazon Redshift stored procedure that processes and inserts data into a table that is not mission critical. The engineer wants to automatically run the stored procedure on a daily basis.

Which solution will meet this requirement in the MOST cost-effective way?

Choices

  • A: Create an AWS Lambda function to schedule a cron job to run the stored procedure.
  • B: Schedule and run the stored procedure by using the Amazon Redshift Data API in an Amazon EC2 Spot Instance.
  • C: Use query editor v2 to run the stored procedure on a schedule.
  • D: Schedule an AWS Glue Python shell job to run the stored procedure.

Question dCIhCgzu42WYnfI1aSK5

Question

A marketing company collects clickstream data. The company sends the clickstream data to Amazon Kinesis Data Firehose and stores the clickstream data in Amazon S3. The company wants to build a series of dashboards that hundreds of users from multiple departments will use.

The company will use Amazon QuickSight to develop the dashboards. The company wants a solution that can scale and provide daily updates about clickstream activity.

Which combination of steps will meet these requirements MOST cost-effectively? (Choose two.)

Choices

  • A: Use Amazon Redshift to store and query the clickstream data.
  • B: Use Amazon Athena to query the clickstream data
  • C: Use Amazon S3 analytics to query the clickstream data.
  • D: Access the query data through a QuickSight direct SQL query.
  • E: Access the query data through QuickSight SPICE (Super-fast, Parallel, In-memory Calculation Engine). Configure a daily refresh for the dataset.