Questions and Answers

Question lU4e7bafmAstjStV9FrR

Question

In which of the following scenarios should a data engineer select a Task in the Depends On field of a new Databricks Job Task?

Choices

  • A: When another task needs to be replaced by the new task
  • B: When another task needs to successfully complete before the new task begins
  • C: When another task has the same dependency libraries as the new task
  • D: When another task needs to use as little compute resources as possible

Question sYl3h2Cr2ujyxxkkfN5s

Question

A data engineering team has two tables. The first table march_transactions is a collection of all retail transactions in the month of March. The second table april_transactions is a collection of all retail transactions in the month of April. There are no duplicate records between the tables.

Which of the following commands should be run to create a new table all_transactions that contains all records from march_transactions and april_transactions without duplicate records?

Choices

  • A: CREATE TABLE all_transactions AS SELECT * FROM march_transactions INNER JOIN SELECT * FROM april_transactions;
  • B: CREATE TABLE all_transactions AS SELECT * FROM march_transactions UNION SELECT * FROM april_transactions;
  • C: CREATE TABLE all_transactions AS SELECT * FROM march_transactions OUTER JOIN SELECT * FROM april_transactions;
  • D: CREATE TABLE all_transactions AS SELECT * FROM march_transactions INTERSECT SELECT * from april_transactions;

Question uEuKrTAjpsA7mk0S7j50

Question

How can Git operations must be performed outside of Databricks Repos?

Choices

  • A: Commit
  • B: Pull
  • C: Merge
  • D: Clone

Question 5eD4YA2cjOIkGu15ValL

Question

A data engineer has joined an existing project and they see the following query in the project repository:

CREATE STREAMING LIVE TABLE loyal_customers AS

SELECT customer_id - FROM STREAM(LIVE.customers) WHERE loyalty_level = ‘high’;

Which of the following describes why the STREAM function is included in the query?

Choices

  • A: The STREAM function is not needed and will cause an error.
  • B: The data in the customers table has been updated since its last run.
  • C: The customers table is a streaming live table.
  • D: The customers table is a reference to a Structured Streaming query on a PySpark DataFrame.

Question LSkeCEdhvOTL6gRpOcKL

Question

Which Structured Streaming query is performing a hop from a Silver table to a Gold table?

Choices

  • A:
  • B:
  • C:
  • D: