A classification task has examples that are labeled as belonging to one of two classes:
•90% of the examples belong to class-1
•10% belong to class-2
Which two techniques are appropriate to deal with the class imbalance? (Choose two.)
What are three elements that are typically part of a machine learning pipeline in scikit-learn or pyspark? (Choose three.)
Which two statements are correct about deploying machine learning models? (Choose two.)
What is used to scale large positive values during data cleaning?
A data scientist is exploring transaction data from a chain of stores with several locations. The data includes store number, date of sale, and purchase amount.
If the data scientist wants to compare total monthly sales between stores, which two options would be good ways to aggregate the data? (Choose two.)
Which two properties hold true for standardized variables (also known as z-score normalization)? (Choose two.)
What are the various components that make up a time series data?
A neural network is trained for a classification task. During training, you monitor the loss function for the train dataset and the validation dataset, along with the accuracy for the validation dataset. The goal is to get an accuracy of 95%.
From the graph, what modification would be appropriate to improve the performance of the model?
Which is the most important thing to ensure while collecting data?
Which distance is applied for multivariate outlier detection?
PDF + Testing Engine
|
---|
$66 |
Testing Engine
|
---|
$50 |
PDF (Q&A)
|
---|
$42 |
IBM Free Exams |
---|
![]() |