Questions and Answers

Question P4gq703E7glyK9hJZhpC

Question

A data engineer has a single-task Job that runs each morning before they begin working. After identifying an upstream data issue, they need to set up another task to run a new notebook prior to the original task. Which of the following approaches can the data engineer use to set up the new task?

Choices

  • A: They can clone the existing task in the existing Job and update it to run the new notebook.
  • B: They can create a new task in the existing Job and then add it as a dependency of the original task.
  • C: They can create a new task in the existing Job and then add the original task as a dependency of the new task.
  • D: They can create a new job from scratch and add both tasks to run concurrently.
  • E: They can clone the existing task to a new Job and then edit it to run the new notebook.

Question 6BSLaJ9rZfiTqHJnCqM9

Question

An engineering manager wants to monitor the performance of a recent project using a Databricks SQL query. For the first week following the project’s release, the manager wants the query results to be updated every minute. However, the manager is concerned that the compute resources used for the query will be left running and cost the organization a lot of money beyond the first week of the project’s release. Which of the following approaches can the engineering team use to ensure the query does not cost the organization any money beyond the first week of the project’s release?

Choices

  • A: They can set a limit to the number of DBUs that are consumed by the SQL Endpoint.
  • B: They can set the query’s refresh schedule to end after a certain number of refreshes.
  • C: They cannot ensure the query does not cost the organization money beyond the first week of the project’s release.
  • D: They can set a limit to the number of individuals that are able to manage the query’s refresh schedule.
  • E: They can set the query’s refresh schedule to end on a certain date in the query scheduler.

Question dgP24CtMews8UDvFtvEQ

Question

A data analysis team has noticed that their Databricks SQL queries are running too slowly when connected to their always-on SQL endpoint. They claim that this issue is present when many members of the team are running small queries simultaneously. They ask the data engineering team for help. The data engineering team notices that each of the team’s queries uses the same SQL endpoint. Which of the following approaches can the data engineering team use to improve the latency of the team’s queries?

Choices

  • A: They can increase the cluster size of the SQL endpoint.
  • B: They can increase the maximum bound of the SQL endpoint’s scaling range.
  • C: They can turn on the Auto Stop feature for the SQL endpoint.
  • D: They can turn on the Serverless feature for the SQL endpoint.
  • E: They can turn on the Serverless feature for the SQL endpoint and change the Spot Instance Policy to “Reliability Optimized.”

Question uAAwwhxaKBTI8TAsD6ni

Question

Which of the following benefits of using the Databricks Lakehouse Platform is provided by Delta Lake?

Choices

  • A: The ability to manipulate the same data using a variety of languages
  • B: The ability to collaborate in real time on a single notebook
  • C: The ability to set up alerts for query failures
  • D: The ability to support batch and streaming workloads
  • E: The ability to distribute complex data operations

Question T5CAvKlX0QEteNkjSQKF

Question

A data engineer wants to schedule their Databricks SQL dashboard to refresh once per day, but they only want the associated SQL endpoint to be running when it is necessary. Which of the following approaches can the data engineer use to minimize the total running time of the SQL endpoint used in the refresh schedule of their dashboard?

Choices

  • A: They can ensure the dashboard’s SQL endpoint matches each of the queries’ SQL endpoints.
  • B: They can set up the dashboard’s SQL endpoint to be serverless.
  • C: They can turn on the Auto Stop feature for the SQL endpoint.
  • D: They can reduce the cluster size of the SQL endpoint.
  • E: They can ensure the dashboard’s SQL endpoint is not one of the included query’s SQL endpoint.