Questions and Answers

Question 8ly8s8V6CLa7IbV5CAYi

Question

A data engineer needs to create a table in Databricks using data from a CSV file at location /path/to/csv.

They run the following command:

//IMG//

Which of the following lines of code fills in the above blank to successfully complete the task?

Choices

  • A: None of these lines of code are needed to successfully complete the task
  • B: USING CSV
  • C: FROM CSV
  • D: USING DELTA
  • E: FROM “path/to/csv”

Question S9PIxGSsSLWrOOBQqUfT

Question

A data engineer has realized that they made a mistake when making a daily update to a table. They need to use Delta time travel to restore the table to a version that is 3 days old. However, when the data engineer attempts to time travel to the older version, they are unable to restore the data because the data files have been deleted. Which of the following explains why the data files are no longer present?

Choices

  • A: The VACUUM command was run on the table
  • B: The TIME TRAVEL command was run on the table
  • C: The DELETE HISTORY command was run on the table
  • D: The OPTIMIZE command was nun on the table
  • E: The HISTORY command was run on the table

Question CesFh273ckQ2F8rdN7u6

Question

A data engineer has configured a Structured Streaming job to read from a table, manipulate the data, and then perform a streaming write into a new table.

The code block used by the data engineer is below:

//IMG//

If the data engineer only wants the query to process all of the available data in as many batches as required, which of the following lines of code should the data engineer use to fill in the blank?

Choices

  • A: processingTime(1)
  • B: trigger(availableNow=True)
  • C: trigger(parallelBatch=True)
  • D: trigger(processingTime=“once”)
  • E: trigger(continuous=“once”)

Question P9uirovIcYtBaOoaKUhD

Question

A data engineer has developed a data pipeline to ingest data from a JSON source using Auto Loader, but the engineer has not provided any type inference or schema hints in their pipeline. Upon reviewing the data, the data engineer has noticed that all of the columns in the target table are of the string type despite some of the fields only including float or boolean values.

Which of the following describes why Auto Loader inferred all of the columns to be of the string type?

Choices

  • A: There was a type mismatch between the specific schema and the inferred schema
  • B: JSON data is a text-based format
  • C: Auto Loader only works with string data
  • D: All of the fields had at least one null value
  • E: Auto Loader cannot infer the schema of ingested data

Question d1ZWko5b9hIrr1TAQDDz

Question

A Delta Live Table pipeline includes two datasets defined using STREAMING LIVE TABLE. Three datasets are defined against Delta Lake table sources using LIVE TABLE.

The table is configured to run in Development mode using the Continuous Pipeline Mode.

Assuming previously unprocessed data exists and all definitions are valid, what is the expected outcome after clicking Start to update the pipeline?

Choices

  • A: All datasets will be updated once and the pipeline will shut down. The compute resources will be terminated.
  • B: All datasets will be updated at set intervals until the pipeline is shut down. The compute resources will persist until the pipeline is shut down.
  • C: All datasets will be updated once and the pipeline will persist without any processing. The compute resources will persist but go unused.
  • D: All datasets will be updated once and the pipeline will shut down. The compute resources will persist to allow for additional testing.
  • E: All datasets will be updated at set intervals until the pipeline is shut down. The compute resources will persist to allow for additional testing.