Questions and Answers

Question ftej0T8v21cl0vbZJlho

Question

Which of the following data workloads will utilize a Gold table as its source?

Choices

  • A: A job that enriches data by parsing its timestamps into a human-readable format
  • B: A job that aggregates uncleaned data to create standard summary statistics
  • C: A job that cleans data by removing malformatted records
  • D: A job that queries aggregated data designed to feed into a dashboard
  • E: A job that ingests raw data from a streaming source into the Lakehouse

Question QMoHFSJDErgLvwwLIXCo

Question

Which of the following must be specified when creating a new Delta Live Tables pipeline?

Choices

  • A: A key-value pair configuration
  • B: The preferred DBU/hour cost
  • C: A path to cloud storage location for the written data
  • D: A location of a target database for the written data
  • E: At least one notebook library to be executed

Question OCBCr8xvRlaCLC2wa5JC

Question

A data engineer has joined an existing project and they see the following query in the project repository:

CREATE STREAMING LIVE TABLE loyal_customers AS

SELECT customer_id - FROM STREAM(LIVE.customers) WHERE loyalty_level = ‘high’;

Which of the following describes why the STREAM function is included in the query?

Choices

  • A: The STREAM function is not needed and will cause an error.
  • B: The table being created is a live table.
  • C: The customers table is a streaming live table.
  • D: The customers table is a reference to a Structured Streaming query on a PySpark DataFrame.
  • E: The data in the customers table has been updated since its last run.

Question odLBIcB5zh1f1qjn4hg6

Question

Which of the following describes the type of workloads that are always compatible with Auto Loader?

Choices

  • A: Streaming workloads
  • B: Machine learning workloads
  • C: Serverless workloads
  • D: Batch workloads
  • E: Dashboard workloads

Question FtYkJMMCmPtaoWBSLOot

Question

A data engineer and data analyst are working together on a data pipeline. The data engineer is working on the raw, bronze, and silver layers of the pipeline using Python, and the data analyst is working on the gold layer of the pipeline using SQL. The raw source of the pipeline is a streaming input. They now want to migrate their pipeline to use Delta Live Tables.

Which of the following changes will need to be made to the pipeline when migrating to Delta Live Tables?

Choices

  • A: None of these changes will need to be made
  • B: The pipeline will need to stop using the medallion-based multi-hop architecture
  • C: The pipeline will need to be written entirely in SQL
  • D: The pipeline will need to use a batch source in place of a streaming source
  • E: The pipeline will need to be written entirely in Python