Summer Special 60% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: bestdeal

Free Databricks Databricks-Certified-Data-Engineer-Associate Practice Exam with Questions & Answers

Questions 1

Which method should a Data Engineer apply to ensure Workflows are being triggered on schedule?

Options:
A.

Scheduled Workflows require an always-running cluster, which is more expensive but reduces processing latency.

B.

Scheduled Workflows process data as it arrives at configured sources.

C.

Scheduled Workflows can reduce resource consumption and expense since the cluster runs only long enough to execute the pipeline.

D.

Scheduled Workflows run continuously until manually stopped.

Questions 2

Which of the following describes a scenario in which a data engineer will want to use a single-node cluster?

Options:
A.

When they are working interactively with a small amount of data

B.

When they are running automated reports to be refreshed as quickly as possible

C.

When they are working with SQL within Databricks SQL

D.

When they are concerned about the ability to automatically scale with larger data

E.

When they are manually running reports with a large amount of data

Questions 3

Which of the following approaches should be used to send the Databricks Job owner an email in the case that the Job fails?

Options:
A.

Manually programming in an alert system in each cell of the Notebook

B.

Setting up an Alert in the Job page

C.

Setting up an Alert in the Notebook

D.

There is no way to notify the Job owner in the case of Job failure

E.

MLflow Model Registry Webhooks

Questions 4

Which tool is used by Auto Loader to process data incrementally?

Options:
A.

Spark Structured Streaming

B.

Unity Catalog

C.

Checkpointing

D.

Databricks SQL

Questions 5

Which SQL keyword can be used to convert a table from a long format to a wide format?

Options:
A.

TRANSFORM

B.

PIVOT

C.

SUM

D.

CONVERT

Questions 6

Which of the following commands can be used to write data into a Delta table while avoiding the writing of duplicate records?

Options:
A.

DROP

B.

IGNORE

C.

MERGE

D.

APPEND

E.

INSERT

Questions 7

A data engineer wants to create a data entity from a couple of tables. The data entity must be used by other data engineers in other sessions. It also must be saved to a physical location.

Which of the following data entities should the data engineer create?

Options:
A.

Database

B.

Function

C.

View

D.

Temporary view

E.

Table

Questions 8

Which of the following must be specified when creating a new Delta Live Tables pipeline?

Options:
A.

A key-value pair configuration

B.

The preferred DBU/hour cost

C.

A path to cloud storage location for the written data

D.

A location of a target database for the written data

E.

At least one notebook library to be executed

Questions 9

A data engineer is designing a data pipeline. The source system generates files in a shared directory that is also used by other processes. As a result, the files should be kept as is and will accumulate in the directory. The data engineer needs to identify which files are new since the previous run in the pipeline, and set up the pipeline to only ingest those new files with each run.

Which of the following tools can the data engineer use to solve this problem?

Options:
A.

Unity Catalog

B.

Delta Lake

C.

Databricks SQL

D.

Data Explorer

E.

Auto Loader

Questions 10

Which of the following describes the type of workloads that are always compatible with Auto Loader?

Options:
A.

Dashboard workloads

B.

Streaming workloads

C.

Machine learning workloads

D.

Serverless workloads

E.

Batch workloads