Summer Special 60% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: bestdeal

Free EMC D-DS-FN-23 Practice Exam with Questions & Answers

Questions 1

A logistic regression model is built to determine the probability of a credit card borrower defaulting on a credit loan. A threshold value of 0.3 is selected. Which statement can be used to predict a borrower will default?

Options:
A.

If probability > 0.1, then predict the borrower will default

B.

If probability < 0.1, then predict the borrower will default

C.

If probability > 0.3, then predict the borrower will default

D.

If probability < 0.3, then predict the borrower will default

EMC D-DS-FN-23 Premium Access
Questions 2

Which SQL set operator returns rows that exist in the first SELECT statement answer set but not in the second SELECT statement?

Options:
A.

EXCEPT

B.

UNION

C.

UNION ALL

D.

INTERSECT

Questions 3

In hypothesis testing, when does a Type I error occur?

Options:
A.

Null hypothesis is rejected when it is actually false

B.

Null hypothesis is rejected when it is actually true

C.

Null hypothesis is accepted when it is actually false

D.

Null hypothesis is accepted when it is actually true

Questions 4

Which SQL OLAP grouping extension returns a result for each output row with 1 identifying a summary row and 0 identifying grouped rows?

Options:
A.

CUBE

B.

GROUPING

C.

GROUP ID

D.

ROLLUP

Questions 5

After which phase of the data analytics lifecycle should you determine if the model needs any recalibration?

Options:
A.

Model planning

B.

Data preparation

C.

Discovery

D.

Operationalize

Questions 6

What is a key consideration when preparing a presentation intended for analysts?

Options:
A.

Describe how to implement the model

B.

Provide talking points to promote or evangelize the project

C.

Emphasize the business benefits of implementing the model

D.

Focus on clean simple-to-understand visuals

Questions 7

D-DS-FN-23 Question 7

Refer to the exhibit.

What is the approximate R-squared value for a linear regression model fitted to the data associated with this scatterplot?

Options:
A.

4

B.

0.96

C.

0.25

D.

16

Questions 8

What characterizes the Hadoop Distributed File System?

Options:
A.

Peer to peer system designed to run on custom designed hardware

B.

Peer to peer system designed to run on commodity hardware

C.

Master/ slave system designed to run on custom designed hardware

D.

Master/ slave system designed to run on commodity hardware

Questions 9

D-DS-FN-23 Question 9

Refer to the exhibit.

To predict whether or not a customer will renew their annual property insurance policy, an insurance company built and operationalized a naïve Bayes classification model. In the model, there are two class labels, renewal and non-renewal, that are assigned to each customer based on their attributes.

A subset of the key attributes, their values, and corresponding conditional probabilities are provided in the exhibit.

A customer has the following attributes:

● Age is greater than 65 years

● Owns their own home

● Renewal month is August

If 20% of customers do not renew the police every year, what is the score for a renewal in the naïve Bayesian model for the customer described above?

Options:
A.

0.0022

B.

0 0027

C.

0.0270

D.

0.0216

Questions 10

In time series analysis, what statement describes a MA(q) process?

Options:
A.

Current deviation from the time series mean depends on the q previous deviations

B.

Current deviation from the time series mean depends on the quotient q

C.

Current time series value depends on the q previous values

D.

Current time series value depends on the fitted polynomial of order q