Spring Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70track

Free Databricks Databricks-Certified-Data-Engineer-Associate Practice Exam with Questions & Answers | Set: 3

Questions 21

A data engineer has been provided a PySpark DataFrame named df with columns product and revenue. The data engineer needs to compute complex aggregations to determine each product's total revenue, average revenue, and transaction count.

Which code snippet should the data engineer use?

A)

Databricks-Certified-Data-Engineer-Associate Question 21

B)

Databricks-Certified-Data-Engineer-Associate Question 21

C)

Databricks-Certified-Data-Engineer-Associate Question 21

D)

Databricks-Certified-Data-Engineer-Associate Question 21

Options:
A.

Option A

B.

Option B

C.

Option C

D.

Option D

Questions 22

A new data engineering team team has been assigned to an ELT project. The new data engineering team will need full privileges on the table sales to fully manage the project.

Which of the following commands can be used to grant full permissions on the database to the new data engineering team?

Options:
A.

GRANT ALL PRIVILEGES ON TABLE sales TO team;

B.

GRANT SELECT CREATE MODIFY ON TABLE sales TO team;

C.

GRANT SELECT ON TABLE sales TO team;

D.

GRANT USAGE ON TABLE sales TO team;

E.

GRANT ALL PRIVILEGES ON TABLE team TO sales;

Questions 23

Which of the following describes a scenario in which a data team will want to utilize cluster pools?

Options:
A.

An automated report needs to be refreshed as quickly as possible.

B.

An automated report needs to be made reproducible.

C.

An automated report needs to be tested to identify errors.

D.

An automated report needs to be version-controlled across multiple collaborators.

E.

An automated report needs to be runnable by all stakeholders.

Questions 24

Which SQL keyword can be used to convert a table from a long format to a wide format?

Options:
A.

TRANSFORM

B.

PIVOT

C.

SUM

D.

CONVERT

Questions 25

What Databricks feature can be used to check the data sources and tables used in a workspace?

Options:
A.

Do not use the lineage feature as it only tracks activity from the last 3 months and will not provide full details on dependencies.

B.

Use the lineage feature to visualize a graph that highlights where the table is used only in notebooks,

C.

Use the lineage feature to visualize a graph that highlights where the table is used only in reports.

D.

Use the lineage feature to visualize a graph that shows all dependencies, including where the table is used in notebooks, other tables, and reports.

Questions 26

A data engineer is attempting to drop a Spark SQL table my_table. The data engineer wants to delete all table metadata and data.

They run the following command:

DROP TABLE IF EXISTS my_table

While the object no longer appears when they run SHOW TABLES, the data files still exist.

Which of the following describes why the data files still exist and the metadata files were deleted?

Options:
A.

The table’s data was larger than 10 GB

B.

The table’s data was smaller than 10 GB

C.

The table was external

D.

The table did not have a location

E.

The table was managed

Questions 27

A data engineer needs to conduct Exploratory Data Analysis (EDA) on data residing in a database within the company’s custom-defined cloud network. The data engineer is using SQL for this task.

Which type of SQL Warehouse will enable the data engineer to process large numbers of queries quickly and cost-effectively?

Options:
A.

All-purpose compute cluster

B.

Pro SQL Warehouse

C.

SQL Serverless Warehouse

D.

Classic SQL Warehouse

Questions 28

Which of the following statements regarding the relationship between Silver tables and Bronze tables is always true?

Options:
A.

Silver tables contain a less refined, less clean view of data than Bronze data.

B.

Silver tables contain aggregates while Bronze data is unaggregated.

C.

Silver tables contain more data than Bronze tables.

D.

Silver tables contain a more refined and cleaner view of data than Bronze tables.

E.

Silver tables contain less data than Bronze tables.

Questions 29

Which of the following commands will return the location of database customer360?

Options:
A.

DESCRIBE LOCATION customer360;

B.

DROP DATABASE customer360;

C.

DESCRIBE DATABASE customer360;

D.

ALTER DATABASE customer360 SET DBPROPERTIES ('location' = '/user'};

E.

USE DATABASE customer360;

Questions 30

A data engineer is using the following code block as part of a batch ingestion pipeline to read from a composable table:

Databricks-Certified-Data-Engineer-Associate Question 30

Which of the following changes needs to be made so this code block will work when the transactions table is a stream source?

Options:
A.

Replace predict with a stream-friendly prediction function

B.

Replace schema(schema) with option ("maxFilesPerTrigger", 1)

C.

Replace "transactions" with the path to the location of the Delta table

D.

Replace format("delta") with format("stream")

E.

Replace spark.read with spark.readStream