Questions and Answers

Question bJ6E8kIO7b33d7CbFEsf

Question

The company stores a large volume of customer records in Amazon S3. To comply with regulations, the company must be able to access new customer records immediately for the first 30 days after the records are created. The company accesses records that are older than 30 days infrequently.

The company needs to cost-optimize its Amazon S3 storage.

Which solution will meet these requirements MOST cost-effectively?

Choices

A: Apply a lifecycle policy to transition records to S3 Standard Infrequent-Access (S3 Standard-IA) storage after 30 days.
B: Use S3 Intelligent-Tiering storage.
C: Transition records to S3 Glacier Deep Archive storage after 30 days.
D: Use S3 Standard-Infrequent Access (S3 Standard-IA) storage for all customer records.

answer?

Answer: A Answer_ET: A Community answer A (100%) Discussion

Comment 1330782 by HagarTheHorrible

Upvotes: 2

Selected Answer: A this is badly defined question, it is not saying what is going on with data in firs 30 days, but cost efficiency indicates it is not B thus I would chose A

Comment 1317810 by emupsx1

Upvotes: 1

Selected Answer: A https://docs.aws.amazon.com/AmazonS3/latest/userguide/object-lifecycle-mgmt.html

Question TcRlNES5HPHOzT6ZKdOt

Question

A data engineer is using Amazon QuickSight to build a dashboard to report a company’s revenue in multiple AWS Regions. The data engineer wants the dashboard to display the total revenue for a Region, regardless of the drill-down levels shown in the visual.

Which solution will meet these requirements?

Choices

A: Create a table calculation.
B: Create a simple calculated field.
C: Create a level-aware calculation - aggregate (LAC-A) function.
D: Create a level-aware calculation - window (LAC-W) function.

answer?

Answer: C Answer_ET: C Community answer C (100%) Discussion

Comment 1317811 by emupsx1

Upvotes: 2

Selected Answer: C https://docs.aws.amazon.com/quicksight/latest/user/level-aware-calculations.html

Question 7KlSZPZC5mmoUD2akw3W

Question

A retail company stores customer data in an Amazon S3 bucket. Some of the customer data contains personally identifiable information (PII) about customers. The company must not share PII data with business partners.

A data engineer must determine whether a dataset contains PII before making objects in the dataset available to business partners.

Which solution will meet this requirement with the LEAST manual intervention?

Choices

A: Configure the S3 bucket and S3 objects to allow access to Amazon Macie. Use automated sensitive data discovery in Macie.
B: Configure AWS CloudTrail to monitor S3 PUT operations. Inspect the CloudTrail trails to identify operations that save PII.
C: Create an AWS Lambda function to identify PII in S3 objects. Schedule the function to run periodically.
D: Create a table in AWS Glue Data Catalog. Write custom SQL queries to identify PII in the table. Use Amazon Athena to run the queries.

answer?

Answer: A Answer_ET: A Community answer A (100%) Discussion

Comment 1341204 by MerryLew

Upvotes: 1

Selected Answer: A Macie is a great option for PII discovery.

Comment 1313972 by michele_scar

Upvotes: 2

Selected Answer: A PII → Macie

Comment 1307116 by kupo777

Upvotes: 2

Correct Answer: A

Amazon Macie is designed for automated sensitive data discovery, classification, and protection. It can scan your S3 buckets to identify and classify PII using machine learning and pattern matching, which means you don’t need to manually inspect data or write custom functions. By configuring Macie to access the S3 bucket, it will continuously monitor and automatically alert you to any PII detected, significantly reducing the need for manual intervention.

Question xDMzs7cotEhbPjnbo485

Question

A data engineer needs to create an empty copy of an existing table in Amazon Athena to perform data processing tasks. The existing table in Athena contains 1,000 rows.

Which query will meet this requirement?

Choices

A: CREATE TABLE new_table - LIKE old_table;
B: CREATE TABLE new_table - AS SELECT *

FROM old_table - WITH NO DATA;

C: CREATE TABLE new_table - AS SELECT * FROM old_table;
D: CREATE TABLE new_table - as SELECT *

FROM old_cable - WHERE 1=1;

answer?

Answer: B Answer_ET: B Community answer B (100%) Discussion

Comment 1305403 by pikuantne

Upvotes: 2

Selected Answer: B Definitely B

Comment 1305356 by truongnguyen86

Upvotes: 3

Selected Answer: B should be B with no data option to create empty table from CTAS

Comment 1303787 by 2022MMTT

Upvotes: 2

B https://docs.aws.amazon.com/athena/latest/ug/ctas-examples.html#ctas-example-empty-table

Comment 1303487 by Parandhaman_Margan

Upvotes: 1

Answer: B

Question xGpZWnbhxMlWVmKrvMt7

Question

A company has a data lake in Amazon S3. The company collects AWS CloudTrail logs for multiple applications. The company stores the logs in the data lake, catalogs the logs in AWS Glue, and partitions the logs based on the year. The company uses Amazon Athena to analyze the logs.

Recently, customers reported that a query on one of the Athena tables did not return any data. A data engineer must resolve the issue.

Which combination of troubleshooting steps should the data engineer take? (Choose two.)

Choices

A: Confirm that Athena is pointing to the correct Amazon S3 location.
B: Increase the query timeout duration.
C: Use the MSCK REPAIR TABLE command.
D: Restart Athena.
E: Delete and recreate the problematic Athena table.

answer?

Answer: AC Answer_ET: AC Community answer AC (100%) Discussion

Comment 1313974 by michele_scar

Upvotes: 2

Selected Answer: AC B and D definitely wrong E should be dangerous

Comment 1307114 by kupo777

Upvotes: 3

Correct Answer: AC

A. Confirm that Athena is pointing to the correct Amazon S3 location.

This is a critical first step to ensure that the data source Athena is querying matches the actual location of the CloudTrail logs in S3. If the path is incorrect, Athena will not find the data. C. Use the MSCK REPAIR TABLE command.

If the data lake is partitioned, using the MSCK REPAIR TABLE command can help update the table metadata in Athena. This command will add any missing partitions to the table, which may resolve issues related to missing data if new partitions were added but not reflected in Athena.

Comment 1305448 by truongnguyen86

Upvotes: 1

should be A, B. D and E are incorrect, C: MSCK REPAIR TABLE is a Hive command in Amazon Athena not for S3

vuthanhdatt's Second Brain

Explorer

Associate-DEA-C01_20

Questions and Answers

Question bJ6E8kIO7b33d7CbFEsf

Question

Choices

Comment 1330782 by HagarTheHorrible

Comment 1317810 by emupsx1

Question TcRlNES5HPHOzT6ZKdOt

Question

Choices

Comment 1317811 by emupsx1

Question 7KlSZPZC5mmoUD2akw3W

Question

Choices

Comment 1341204 by MerryLew

Comment 1313972 by michele_scar

Comment 1307116 by kupo777

Question xDMzs7cotEhbPjnbo485

Question

Choices

Comment 1305403 by pikuantne

Comment 1305356 by truongnguyen86

Comment 1303787 by 2022MMTT

Comment 1303487 by Parandhaman_Margan

Question xGpZWnbhxMlWVmKrvMt7

Question

Choices

Comment 1313974 by michele_scar

Comment 1307114 by kupo777

Comment 1305448 by truongnguyen86

Graph View

Table of Contents