Questions and Answers

Question MI5jKvqslZrq06vTK1eX

Question

A dataset has been defined using Delta Live Tables and includes an expectations clause:

CONSTRAINT valid_timestamp EXPECT (timestamp > ‘2020-01-01’) ON VIOLATION DROP ROW

What is the expected behavior when a batch of data containing data that violates these constraints is processed?

Choices

  • A: Records that violate the expectation cause the job to fail.
  • B: Records that violate the expectation are added to the target dataset and flagged as invalid in a field added to the target dataset.
  • C: Records that violate the expectation are dropped from the target dataset and recorded as invalid in the event log.
  • D: Records that violate the expectation are added to the target dataset and recorded as invalid in the event log.

Question hlo6no0YikmL1mMhHYQO

Question

A data engineer has a Job with multiple tasks that runs nightly. Each of the tasks runs slowly because the clusters take a long time to start.

Which action can the data engineer perform to improve the start up time for the clusters used for the Job?

Choices

  • A: They can use endpoints available in Databricks SQL
  • B: They can use jobs clusters instead of all-purpose clusters
  • C: They can configure the clusters to autoscale for larger data sizes
  • D: They can use clusters that are from a cluster pool

Question HmfSB6m4BqIbGg8ykrPX

Question

A data engineer has a single-task Job that runs each morning before they begin working. After identifying an upstream data issue, they need to set up another task to run a new notebook prior to the original task.

Which approach can the data engineer use to set up the new task?

Choices

  • A: They can clone the existing task in the existing Job and update it to run the new notebook.
  • B: They can create a new task in the existing Job and then add it as a dependency of the original task.
  • C: They can create a new task in the existing Job and then add the original task as a dependency of the new task.
  • D: They can create a new job from scratch and add both tasks to run concurrently.

Question d23UJBhkUBBtyZFmFl3D

Question

A single Job runs two notebooks as two separate tasks. A data engineer has noticed that one of the notebooks is running slowly in the Job’s current run. The data engineer asks a tech lead for help in identifying why this might be the case.

Which approach can the tech lead use to identify why the notebook is running slowly as part of the Job?

Choices

  • A: They can navigate to the Runs tab in the Jobs UI to immediately review the processing notebook.
  • B: They can navigate to the Tasks tab in the Jobs UI and click on the active run to review the processing notebook.
  • C: They can navigate to the Runs tab in the Jobs UI and click on the active run to review the processing notebook.
  • D: They can navigate to the Tasks tab in the Jobs UI to immediately review the processing notebook.

Question zU0SrUhh0MdpRvtWVmW6

Question

Which of the following commands will return the location of database customer360?

Choices

  • A: DESCRIBE LOCATION customer360;
  • B: DROP DATABASE customer360;
  • C: DESCRIBE DATABASE customer360;
  • D: ALTER DATABASE customer360 SET DBPROPERTIES (‘location’ = ‘/user’};
  • E: USE DATABASE customer360;