Questions and Answers
Question bJ6E8kIO7b33d7CbFEsf
Question
The company stores a large volume of customer records in Amazon S3. To comply with regulations, the company must be able to access new customer records immediately for the first 30 days after the records are created. The company accesses records that are older than 30 days infrequently.
The company needs to cost-optimize its Amazon S3 storage.
Which solution will meet these requirements MOST cost-effectively?
Choices
- A: Apply a lifecycle policy to transition records to S3 Standard Infrequent-Access (S3 Standard-IA) storage after 30 days.
- B: Use S3 Intelligent-Tiering storage.
- C: Transition records to S3 Glacier Deep Archive storage after 30 days.
- D: Use S3 Standard-Infrequent Access (S3 Standard-IA) storage for all customer records.
answer?
Answer: A Answer_ET: A Community answer A (100%) Discussion
Comment 1330782 by HagarTheHorrible
- Upvotes: 2
Selected Answer: A this is badly defined question, it is not saying what is going on with data in firs 30 days, but cost efficiency indicates it is not B thus I would chose A
Comment 1317810 by emupsx1
- Upvotes: 1
Selected Answer: A https://docs.aws.amazon.com/AmazonS3/latest/userguide/object-lifecycle-mgmt.html
Question TcRlNES5HPHOzT6ZKdOt
Question
A data engineer is using Amazon QuickSight to build a dashboard to report a company’s revenue in multiple AWS Regions. The data engineer wants the dashboard to display the total revenue for a Region, regardless of the drill-down levels shown in the visual.
Which solution will meet these requirements?
Choices
- A: Create a table calculation.
- B: Create a simple calculated field.
- C: Create a level-aware calculation - aggregate (LAC-A) function.
- D: Create a level-aware calculation - window (LAC-W) function.
answer?
Answer: C Answer_ET: C Community answer C (100%) Discussion
Comment 1317811 by emupsx1
- Upvotes: 2
Selected Answer: C https://docs.aws.amazon.com/quicksight/latest/user/level-aware-calculations.html
Question 7KlSZPZC5mmoUD2akw3W
Question
A retail company stores customer data in an Amazon S3 bucket. Some of the customer data contains personally identifiable information (PII) about customers. The company must not share PII data with business partners.
A data engineer must determine whether a dataset contains PII before making objects in the dataset available to business partners.
Which solution will meet this requirement with the LEAST manual intervention?
Choices
- A: Configure the S3 bucket and S3 objects to allow access to Amazon Macie. Use automated sensitive data discovery in Macie.
- B: Configure AWS CloudTrail to monitor S3 PUT operations. Inspect the CloudTrail trails to identify operations that save PII.
- C: Create an AWS Lambda function to identify PII in S3 objects. Schedule the function to run periodically.
- D: Create a table in AWS Glue Data Catalog. Write custom SQL queries to identify PII in the table. Use Amazon Athena to run the queries.
answer?
Answer: A Answer_ET: A Community answer A (100%) Discussion
Comment 1341204 by MerryLew
- Upvotes: 1
Selected Answer: A Macie is a great option for PII discovery.
Comment 1313972 by michele_scar
- Upvotes: 2
Selected Answer: A PII → Macie
Comment 1307116 by kupo777
- Upvotes: 2
Correct Answer: A
Amazon Macie is designed for automated sensitive data discovery, classification, and protection. It can scan your S3 buckets to identify and classify PII using machine learning and pattern matching, which means you don’t need to manually inspect data or write custom functions. By configuring Macie to access the S3 bucket, it will continuously monitor and automatically alert you to any PII detected, significantly reducing the need for manual intervention.
Question xDMzs7cotEhbPjnbo485
Question
A data engineer needs to create an empty copy of an existing table in Amazon Athena to perform data processing tasks. The existing table in Athena contains 1,000 rows.
Which query will meet this requirement?
Choices
- A: CREATE TABLE new_table - LIKE old_table;
- B: CREATE TABLE new_table - AS SELECT *
FROM old_table - WITH NO DATA;
- C: CREATE TABLE new_table - AS SELECT * FROM old_table;
- D: CREATE TABLE new_table - as SELECT *
FROM old_cable - WHERE 1=1;
answer?
Answer: B Answer_ET: B Community answer B (100%) Discussion
Comment 1305403 by pikuantne
- Upvotes: 2
Selected Answer: B Definitely B
Comment 1305356 by truongnguyen86
- Upvotes: 3
Selected Answer: B should be B with no data option to create empty table from CTAS
Comment 1303787 by 2022MMTT
- Upvotes: 2
B https://docs.aws.amazon.com/athena/latest/ug/ctas-examples.html#ctas-example-empty-table
Comment 1303487 by Parandhaman_Margan
- Upvotes: 1
Answer: B
Question xGpZWnbhxMlWVmKrvMt7
Question
A company has a data lake in Amazon S3. The company collects AWS CloudTrail logs for multiple applications. The company stores the logs in the data lake, catalogs the logs in AWS Glue, and partitions the logs based on the year. The company uses Amazon Athena to analyze the logs.
Recently, customers reported that a query on one of the Athena tables did not return any data. A data engineer must resolve the issue.
Which combination of troubleshooting steps should the data engineer take? (Choose two.)
Choices
- A: Confirm that Athena is pointing to the correct Amazon S3 location.
- B: Increase the query timeout duration.
- C: Use the MSCK REPAIR TABLE command.
- D: Restart Athena.
- E: Delete and recreate the problematic Athena table.
answer?
Answer: AC Answer_ET: AC Community answer AC (100%) Discussion
Comment 1313974 by michele_scar
- Upvotes: 2
Selected Answer: AC B and D definitely wrong E should be dangerous
Comment 1307114 by kupo777
- Upvotes: 3
Correct Answer: AC
A. Confirm that Athena is pointing to the correct Amazon S3 location.
This is a critical first step to ensure that the data source Athena is querying matches the actual location of the CloudTrail logs in S3. If the path is incorrect, Athena will not find the data. C. Use the MSCK REPAIR TABLE command.
If the data lake is partitioned, using the MSCK REPAIR TABLE command can help update the table metadata in Athena. This command will add any missing partitions to the table, which may resolve issues related to missing data if new partitions were added but not reflected in Athena.
Comment 1305448 by truongnguyen86
- Upvotes: 1
should be A, B. D and E are incorrect, C: MSCK REPAIR TABLE is a Hive command in Amazon Athena not for S3