Questions and Answers

Question zDtHu7du1HTzxqYyMDeg

Question

A retail company uses an Amazon Redshift data warehouse and an Amazon S3 bucket. The company ingests retail order data into the S3 bucket every day.

The company stores all order data at a single path within the S3 bucket. The data has more than 100 columns. The company ingests the order data from a third-party application that generates more than 30 files in CSV format every day. Each CSV file is between 50 and 70 MB in size.

The company uses Amazon Redshift Spectrum to run queries that select sets of columns. Users aggregate metrics based on daily orders. Recently, users have reported that the performance of the queries has degraded. A data engineer must resolve the performance issues for the queries.

Which combination of steps will meet this requirement with LEAST developmental effort? (Choose two.)

Choices

  • A: Configure the third-party application to create the files in a columnar format.
  • B: Develop an AWS Glue ETL job to convert the multiple daily CSV files to one file for each day.
  • C: Partition the order data in the S3 bucket based on order date.
  • D: Configure the third-party application to create the files in JSON format.
  • E: Load the JSON data into the Amazon Redshift table in a SUPER type column.

Question H6apOuWdnjeGjcKzklBG

Question

A company stores customer records in Amazon S3. The company must not delete or modify the customer record data for 7 years after each record is created. The root user also must not have the ability to delete or modify the data.

A data engineer wants to use S3 Object Lock to secure the data.

Which solution will meet these requirements?

Choices

  • A: Enable governance mode on the S3 bucket. Use a default retention period of 7 years.
  • B: Enable compliance mode on the S3 bucket. Use a default retention period of 7 years.
  • C: Place a legal hold on individual objects in the S3 bucket. Set the retention period to 7 years.
  • D: Set the retention period for individual objects in the S3 bucket to 7 years.

Question 0I5mbuFfIuSm1i2vG5Zh

Question

A data engineer needs to create a new empty table in Amazon Athena that has the same schema as an existing table named old_table.

Which SQL statement should the data engineer use to meet this requirement?

Choices

  • A: CREATE TABLE new_table AS SELECT * FROM old_tables;
  • B: INSERT INTO new_table SELECT * FROM old_table;
  • C: CREATE TABLE new_table (LIKE old_table);
  • D: CREATE TABLE new_table AS (SELECT * FROM old_table) WITH NO DATA;

Question cRVgnAk0leTLEkJ44Ri6

Question

A data engineer needs to create an Amazon Athena table based on a subset of data from an existing Athena table named cities_world. The cities_world table contains cities that are located around the world. The data engineer must create a new table named cities_us to contain only the cities from cities_world that are located in the US.

Which SQL statement should the data engineer use to meet this requirement?

Choices

  • A: INSERT INTO cities_usa (city,state) SELECT city, state FROM cities_world WHERE country=’usa’;
  • B: MOVE city, state FROM cities_world TO cities_usa WHERE country=’usa’;
  • C: INSERT INTO cities_usa SELECT city, state FROM cities_world WHERE country=’usa’;
  • D: UPDATE cities_usa SET (city, state) = (SELECT city, state FROM cities_world WHERE country=’usa’);

Question PvaK00CG9s2z5cr8ApIg

Question

A company implements a data mesh that has a central governance account. The company needs to catalog all data in the governance account. The governance account uses AWS Lake Formation to centrally share data and grant access permissions.

The company has created a new data product that includes a group of Amazon Redshift Serverless tables. A data engineer needs to share the data product with a marketing team. The marketing team must have access to only a subset of columns. The data engineer needs to share the same data product with a compliance team. The compliance team must have access to a different subset of columns than the marketing team needs access to.

Which combination of steps should the data engineer take to meet these requirements? (Choose two.)

Choices

  • A: Create views of the tables that need to be shared. Include only the required columns.
  • B: Create an Amazon Redshift data share that includes the tables that need to be shared.
  • C: Create an Amazon Redshift managed VPC endpoint in the marketing team’s account. Grant the marketing team access to the views.
  • D: Share the Amazon Redshift data share to the Lake Formation catalog in the governance account.
  • E: Share the Amazon Redshift data share to the Amazon Redshift Serverless workgroup in the marketing team’s account.