Questions and Answers

Question rnFB2GLwmLx1aziPGrhl

Question

A data engineer runs a statement every day to copy the previous day’s sales into the table transactions. Each day’s sales are in their own file in the location “/transactions/raw”.

Today, the data engineer runs the following command to complete this task:

//IMG//

After running the command today, the data engineer notices that the number of records in table transactions has not changed.

What explains why the statement might not have copied any new records into the table?

Choices

  • A: The format of the files to be copied were not included with the FORMAT_OPTIONS keyword.
  • B: The COPY INTO statement requires the table to be refreshed to view the copied rows.
  • C: The previous day’s file has already been copied into the table.
  • D: The PARQUET file format does not support COPY INTO.

Question DxPV277yJhl6Y3TORBBz

Question

Which command can be used to write data into a Delta table while avoiding the writing of duplicate records?

Choices

  • A: DROP
  • B: INSERT
  • C: MERGE
  • D: APPEND

Question vEQRkhP4UsW0N3NCPPuQ

Question

A data analyst has created a Delta table sales that is used by the entire data analysis team. They want help from the data engineering team to implement a series of tests to ensure the data is clean. However, the data engineering team uses Python for its tests rather than SQL.

Which command could the data engineering team use to access sales in PySpark?

Choices

  • A: SELECT * FROM sales
  • B: spark.table(“sales”)
  • C: spark.sql(“sales”)
  • D: spark.delta.table(“sales”)

Question 0rGo35Q7LYULQH6s3bEg

Question

A data engineer has created a new database using the following command:

CREATE DATABASE IF NOT EXISTS customer360;

In which location will the customer360 database be located?

Choices

  • A: dbfs:/user/hive/database/customer360
  • B: dbfs:/user/hive/warehouse
  • C: dbfs:/user/hive/customer360
  • D: dbfs:/user/hive/database

Question CdXnunYEVub9xYMp0KCQ

Question

A data engineer is attempting to drop a Spark SQL table my_table and runs the following command:

DROP TABLE IF EXISTS my_table;

After running this command, the engineer notices that the data files and metadata files have been deleted from the file system.

What is the reason behind the deletion of all these files?

Choices

  • A: The table was managed
  • B: The table’s data was smaller than 10 GB
  • C: The table did not have a location
  • D: The table was external