Questions and Answers
Question LrA4317n2GaaQlPO8flY
Question
A data engineer is working with two tables. Each of these tables is displayed below in its entirety.
//IMG//
The data engineer runs the following query to join these tables together:
//IMG//
Choices
- A:
- B:
- C:
- D:
answer?
Answer: C Answer_ET: C Community answer C (86%) 14% Discussion
Comment 1282707 by CommanderBigMac
- Upvotes: 3
Selected Answer: C Answer is C, we’re using a LEFT JOIN.
Comment 1233678 by vigaro
- Upvotes: 3
Selected Answer: C LEFT join. D is the result of a FULL join
Comment 1232723 by nescafe7
- Upvotes: 1
C - left outer join
Comment 1230482 by 31cadd7
- Upvotes: 1
Selected Answer: D IT’S D
Comment 1215998 by b79962e
- Upvotes: 1
Sorry wrong it is C for the left join
Comment 1215996 by b79962e
- Upvotes: 1
should be D
Question Q6OzF41sX2tVrIJSpcz9
Question
A data engineer needs to apply custom logic to identify employees with more than 5 years of experience in array column employees in table stores. The custom logic should create a new column exp_employees that is an array of all of the employees with more than 5 years of experience for each row. In order to apply this custom logic at scale, the data engineer wants to use the FILTER higher-order function.
Which code block successfully completes this task?
Choices
- A:
- B:
- C:
- D:
answer?
Answer: A Answer_ET: A Community answer A (100%) Discussion
Comment 1404789 by Billybob0604
- Upvotes: 1
Selected Answer: A There s a lot of double questions in this exam
Comment 1282708 by CommanderBigMac
- Upvotes: 2
Selected Answer: A Answer is A, if for no other reason than it is the only correct syntax of filter.
Question 48UiuygU0O5qgGLLKqXf
Question
A data engineer that is new to using Python needs to create a Python function to add two integers together and return the sum?
Which code block can the data engineer use to complete this task?
Choices
- A:
- B:
- C:
- D:
answer?
Answer: D Answer_ET: D Community answer D (100%) Discussion
Comment 1365985 by analyticstraining
- Upvotes: 1
Selected Answer: D It’s the same question as 66. Correct answer is D
Comment 1291134 by alinavit
- Upvotes: 1
Selected Answer: D correct is D
Comment 1265485 by 80370eb
- Upvotes: 1
Selected Answer: D def function must have “return” present.
Comment 1244580 by 3fbc31b
- Upvotes: 1
Selected Answer: D For the function to perform the actions “RETURN” must be present.
Comment 1227358 by hussamAlHunaiti
- Upvotes: 1
Selected Answer: D Answer is D.
Comment 1216530 by nawfalbourass
- Upvotes: 2
Selected Answer: D RETURN is needed
Comment 1215962 by PreranaC
- Upvotes: 1
Selected Answer: D D, RETURN in Python needed
Comment 1209947 by Kunka
- Upvotes: 1
Answer is D. C is wrong answer, as it missed return key workd
Comment 1209374 by Ivan_Petrov
- Upvotes: 2
Correct answer is D
Comment 1209097 by helmerpaiva
- Upvotes: 3
Correct is D
Comment 1208988 by jetplanes
- Upvotes: 1
Selected Answer: D The correct answer is D because the python function needs to provide a return for the function, so C is incorrect.
Question KQAMqetIT9xSBu3hY693
Question
A data engineer has configured a Structured Streaming job to read from a table, manipulate the data, and then perform a streaming write into a new table.
The code block used by the data engineer is below:
//IMG//
Which line of code should the data engineer use to fill in the blank if the data engineer only wants the query to execute a micro-batch to process data every 5 seconds?
Choices
- A: trigger(“5 seconds”)
- B: trigger(continuous=“5 seconds”)
- C: trigger(once=“5 seconds”)
- D: trigger(processingTime=“5 seconds”)
answer?
Answer: D Answer_ET: D Community answer D (100%) Discussion
Comment 1282709 by CommanderBigMac
- Upvotes: 2
Selected Answer: D D is the correct answer
Comment 1273047 by 9d4d68a
- Upvotes: 2
Repeated, Correct
Question bJRjR2PN6qthdcuI4ral
Question
A data engineer is maintaining a data pipeline. Upon data ingestion, the data engineer notices that the source data is starting to have a lower level of quality. The data engineer would like to automate the process of monitoring the quality level.
Which of the following tools can the data engineer use to solve this problem?
Choices
- A: Auto Loader
- B: Unity Catalog
- C: Delta Lake
- D: Delta Live Tables
answer?
Answer: D Answer_ET: D Community answer D (100%) Discussion
Comment 1316027 by 806e7d2
- Upvotes: 1
Selected Answer: D Delta Live Tables (DLT) is specifically designed for building and maintaining reliable, automated data pipelines. It includes built-in data quality enforcement through expectations and constraints, which allows data engineers to:
Define Data Quality Expectations: For example, you can define constraints such as CONSTRAINT valid_values EXPECT (column_name IS NOT NULL) to automatically enforce data quality rules. Monitor Violations: DLT tracks data quality metrics and logs violations, enabling monitoring of data quality trends over time. Automated Notifications and Actions: DLT can handle violations (e.g., drop rows, fail the pipeline, or log them for review) in an automated way.