You are creating a Classification process where input is the income, education and current debt of a customer, what could be the possible output of this process.
You are working with the Clustering solution of the customer datasets. There are almost 40 variables are available for each customer and almost 1.00,0000 customer's data is available. You want to reduce the number of variables for clustering, what would you do?
Which of the following is not a correct application for the Classification?
Select the correct statement which applies to K-Nearest Neighbors
You are using k-means clustering to classify heart patients for a hospital. You have chosen Patient Sex, Height, Weight, Age and Income as measures and have used 3 clusters. When you create a pair-wise plot of the clusters, you notice that there is significant overlap between the clusters. What should you do?
Select the correct option which applies to L2 regularization
Feature Hashing approach is "SGD-based classifiers avoid the need to predetermine vector size by simply picking a reasonable size and shoehorning the training data into vectors of that size" now with large vectors or with multiple locations per feature in Feature hashing?
Which of the following are point estimation methods?
Suppose there are three events then which formula must always be equal to P(E1|E2,E3)?
Digit recognition, is an example of.....
PDF + Testing Engine
|
---|
$66 |
Testing Engine
|
---|
$50 |
PDF (Q&A)
|
---|
$42 |
Databricks Free Exams |
---|
![]() |