Questions and Answers
Question DECn5urw7a7L6fAegpBb
Question
A Delta Live Table pipeline includes two datasets defined using STREAMING LIVE TABLE. Three datasets are defined against Delta Lake table sources using LIVE TABLE.
The table is configured to run in Production mode using the Continuous Pipeline Mode.
What is the expected outcome after clicking Start to update the pipeline assuming previously unprocessed data exists and all definitions are valid?
Choices
- A: All datasets will be updated at set intervals until the pipeline is shut down. The compute resources will persist to allow for additional testing.
- B: All datasets will be updated once and the pipeline will shut down. The compute resources will persist to allow for additional testing.
- C: All datasets will be updated at set intervals until the pipeline is shut down. The compute resources will be deployed for the update and terminated when the pipeline is stopped.
- D: All datasets will be updated once and the pipeline will shut down. The compute resources will be terminated.
answer?
Answer: C Answer_ET: C Community answer C (100%) Discussion
Comment 1327319 by MultiCloudIronMan
- Upvotes: 1
Selected Answer: C Continuous Pipeline mode
Comment 1285885 by CommanderBigMac
- Upvotes: 2
Selected Answer: C Continuous Pipeline Mode in Production mode implies that the pipeline continuously processes incoming data updates at set intervals, ensuring the datasets are kept up-to-date as new data arrives. Since the pipeline is set to Continuous Pipeline Mode, it will keep running and updating the datasets until it is manually shut down. The compute resources are allocated dynamically to process and update the datasets as needed, and they will be terminated when the pipeline is stopped or shut down. This mode allows for real-time or near-real-time updates to the datasets from the streaming/live tables, ensuring that the data remains current and reflects the changes occurring in the data sources.
Question htfxurHt4TGUlXuhaaps
Question
Which type of workloads are compatible with Auto Loader?
Choices
- A: Streaming workloads
- B: Machine learning workloads
- C: Serverless workloads
- D: Batch workloads
answer?
Answer: A Answer_ET: A Community answer A (67%) D (33%) Discussion
Comment 1353833 by Geera
- Upvotes: 1
Selected Answer: D A and D. Autoloader can also be used for batch processing by triggering once, so Autoloader is compatible with both streaming and batch workloads.
Comment 1327321 by MultiCloudIronMan
- Upvotes: 1
Selected Answer: A Near realtime data feed
Comment 1299603 by RandomForest
- Upvotes: 1
Selected Answer: A Auto Loader is designed to ingest data continuously from data lakes and process it in real-time. It can efficiently handle streaming data by detecting and processing new files as they arrive.
Question SVViXkByhOZNTqBjXUS2
Question
A data engineer has developed a data pipeline to ingest data from a JSON source using Auto Loader, but the engineer has not provided any type inference or schema hints in their pipeline. Upon reviewing the data, the data engineer has noticed that all of the columns in the target table are of the string type despite some of the fields only including float or boolean values.
Why has Auto Loader inferred all of the columns to be of the string type?
Choices
- A: Auto Loader cannot infer the schema of ingested data
- B: JSON data is a text-based format
- C: Auto Loader only works with string data
- D: All of the fields had at least one null value
answer?
Answer: B Answer_ET: B Community answer B (100%) Discussion
Comment 1299605 by RandomForest
- Upvotes: 1
Selected Answer: B The correct answer is B as JSON files do not include datatypes
Comment 1250795 by csrazdan
- Upvotes: 1
Selected Answer: B JSON file does not include datatype and all columns are defaulted as string
Question ojTkuxlIjHIclnPib6lQ
Question
Which statement regarding the relationship between Silver tables and Bronze tables is always true?
Choices
- A: Silver tables contain a less refined, less clean view of data than Bronze data.
- B: Silver tables contain aggregates while Bronze data is unaggregated.
- C: Silver tables contain more data than Bronze tables.
- D: Silver tables contain less data than Bronze tables.
answer?
Answer: D Answer_ET: D Community answer D (100%) Discussion
Comment 1209375 by Ivan_Petrov
- Upvotes: 11
looks like there is no correct answer. Shold be like A but Silver and Bronze should be changed in their places
Comment 1282725 by CommanderBigMac
- Upvotes: 1
Selected Answer: D Silver table is the step of reducing and refining data after raw data was imported into bronze table. Gold table would be the last step where data aggregation is applied.
Comment 1270629 by 22d4d76
- Upvotes: 1
Cannot be D. If raw data are clean enough already and have no aggregations or other functions before to be in the silver table, there will be the same amount of data both side. A seems to be a better answer because from raw you will have more chance to make at least one transformation to refine data.
Comment 1267943 by 80370eb
- Upvotes: 1
Selected Answer: D D. Silver tables contain less data than Bronze tables.
This is because Silver tables are typically more refined and processed versions of the raw data found in Bronze tables. Bronze tables often contain raw, unprocessed data, while Silver tables contain cleaned, filtered, or aggregated data. Therefore, Silver tables usually contain a subset or a refined version of the data in the Bronze tables, leading to less overall data in the Silver tables.
Comment 1236565 by vigaro
- Upvotes: 2
Selected Answer: D silver have some data filters
Comment 1226140 by mgari
- Upvotes: 2
in my opinion it is D Silver has only the data with no error
Comment 1216000 by b79962e
- Upvotes: 2
I think there is no correct answer
Comment 1215990 by PreranaC
- Upvotes: 2
Silver tables contain a more refined and cleaner view of data than Bronze tables.
Comment 1209100 by helmerpaiva
- Upvotes: 1
Correct is A
Question 3reS8fKvK3eG1CD4KqGk
Question
Which query is performing a streaming hop from raw data to a Bronze table?
Choices
- A:
- B:
- C:
- D:
answer?
Answer: D Answer_ET: D Community answer D (100%) Discussion
Comment 1306475 by Pankaj_Shet
- Upvotes: 1
Correct Answer is C. A and B are the aggregations There is no transformation in D.
Comment 1299607 by RandomForest
- Upvotes: 2
Selected Answer: D Correct answer is D.
Comment 1282727 by CommanderBigMac
- Upvotes: 2
Selected Answer: D Question specified streaming hop. D input from raw and is writeStream.