Questions and Answers

Question qBXUvi4uADyEChXMpeSJ

Question

A data engineer is using an AWS Glue crawler to catalog data that is in an Amazon S3 bucket. The S3 bucket contains both .csv and json files. The data engineer configured the crawler to exclude the .json files from the catalog.

When the data engineer runs queries in Amazon Athena, the queries also process the excluded .json files. The data engineer wants to resolve this issue. The data engineer needs a solution that will not affect access requirements for the .csv files in the source S3 bucket.

Which solution will meet this requirement with the SHORTEST query times?

Choices

  • A: Adjust the AWS Glue crawler settings to ensure that the AWS Glue crawler also excludes .json files.
  • B: Use the Athena console to ensure the Athena queries also exclude the .json files.
  • C: Relocate the .json files to a different path within the S3 bucket.
  • D: Use S3 bucket policies to block access to the .json files.

Question CEdPbL5xORbecPYTGQVc

Question

A data engineer set up an AWS Lambda function to read an object that is stored in an Amazon S3 bucket. The object is encrypted by an AWS KMS key.

The data engineer configured the Lambda function’s execution role to access the S3 bucket. However, the Lambda function encountered an error and failed to retrieve the content of the object.

What is the likely cause of the error?

Choices

  • A: The data engineer misconfigured the permissions of the S3 bucket. The Lambda function could not access the object.
  • B: The Lambda function is using an outdated SDK version, which caused the read failure.
  • C: The S3 bucket is located in a different AWS Region than the Region where the data engineer works. Latency issues caused the Lambda function to encounter an error.
  • D: The Lambda function’s execution role does not have the necessary permissions to access the KMS key that can decrypt the S3 object.

Question WqruLrifFwoPia9DAT15

Question

A data engineer has implemented data quality rules in 1,000 AWS Glue Data Catalog tables. Because of a recent change in business requirements, the data engineer must edit the data quality rules.

How should the data engineer meet this requirement with the LEAST operational overhead?

Choices

  • A: Create a pipeline in AWS Glue ETL to edit the rules for each of the 1,000 Data Catalog tables. Use an AWS Lambda function to call the corresponding AWS Glue job for each Data Catalog table.
  • B: Create an AWS Lambda function that makes an API call to AWS Glue Data Quality to make the edits.
  • C: Create an Amazon EMR cluster. Run a pipeline on Amazon EMR that edits the rules for each Data Catalog table. Use an AWS Lambda function to run the EMR pipeline.
  • D: Use the AWS Management Console to edit the rules within the Data Catalog.

Question 6aLpnSlk1kMn3iIUNBsP

Question

Two developers are working on separate application releases. The developers have created feature branches named Branch A and Branch B by using a GitHub repository’s master branch as the source.

The developer for Branch A deployed code to the production system. The code for Branch B will merge into a master branch in the following week’s scheduled application release.

Which command should the developer for Branch B run before the developer raises a pull request to the master branch?

Choices

  • A: git diff branchB master git commit -m
  • B: git pull master
  • C: git rebase master
  • D: git fetch -b master

Question JjReFmb3fkL3YgBB8fPk

Question

A company stores employee data in Amazon Resdshift. A table names Employee uses columns named Region ID, Department ID, and Role ID as a compound sort key.

Which queries will MOST increase the speed of query by using a compound sort key of the table? (Choose two.)

Choices

  • A: Select *from Employee where Region ID=’North America’;
  • B: Select *from Employee where Region ID=’North America’ and Department ID=20;
  • C: Select *from Employee where Department ID=20 and Region ID=’North America’;
  • D: Select *from Employee where Role ID=50;
  • E: Select *from Employee where Region ID=’North America’ and Role ID=50;