Questions and Answers

Question WGaFObuXnaOgyZiQDkyF

Question

A data engineer needs to access the view created by the sales team, using a shared cluster. The data engineer has been provided usage permissions on the catalog and schema. In order to access the view created by sales team.

What are the minimum permissions the data engineer would require in addition?

Choices

  • A: Needs SELECT permission on the VIEW and the underlying TABLE.
  • B: Needs SELECT permission only on the VIEW
  • C: Needs ALL PRIVILEGES on the VIEW
  • D: Needs ALL PRIVILEGES at the SCHEMA level

Question tyYSr1qhAOFTqEmvLaab

Question

Which method should a Data Engineer apply to ensure Workflows are being triggered on schedule?

Choices

  • A: Scheduled Workflows require an always-running cluster, which is more expensive but reduces processing latency.
  • B: Scheduled Workflows process data as it arrives at configured sources.
  • C: Scheduled Workflows can reduce resource consumption and expense since the cluster runs only long enough to execute the pipeline.
  • D: Scheduled Workflows run continuously until manually stopped.

Question VQpX3T8qMmmaCb2DwIzv

Question

The Delta transaction log for the ‘students’ tables is shown using the ‘DESCRIBE HISTORY students’ command. A Data Engineer needs to query the table as it existed before the UPDATE operation listed in the log.

Which command should the Data Engineer use to achieve this? (Choose two.)

//IMG//

Choices

  • A: SELECT * FROM students@v4
  • B: SELECT * FROM students TIMESTAMP AS OF ‘2024-04-22T 14:32:47.000+00:00’
  • C: SELECT * FROM students FROM HISTORY VERSION AS OF 3
  • D: SELECT * FROM students VERSION AS OF 5
  • E: SELECT * FROM students TIMESTAMP AS OF ‘2024-04-22T 14:32:58.000+00:00’

Question Me4u56mwCkH5izMPyEgE

Question

An engineering manager uses a Databricks SQL query to monitor ingestion latency for each data source. The manager checks the results of the query every day, but they are manually rerunning the query each day and waiting for the results.

Which of the following approaches can the manager use to ensure the results of the query are updated each day?

Choices

  • A: They can schedule the query to refresh every 1 day from the SQL endpoint’s page in Databricks SQL.
  • B: They can schedule the query to refresh every 12 hours from the SQL endpoint’s page in Databricks SQL.
  • C: They can schedule the query to refresh every 1 day from the query’s page in Databricks SQL.
  • D: They can schedule the query to run every 12 hours from the Jobs UI.

Question ILDQ79FBUW7OY6aWyErm

Question

Which of the following benefits is provided by the array functions from Spark SQL?

Choices

  • A: An ability to work with data in a variety of types at once
  • B: An ability to work with data within certain partitions and windows
  • C: An ability to work with time-related data in specified intervals
  • D: An ability to work with complex, nested data ingested from JSON files
  • E: An ability to work with an array of tables for procedural automation