Home > Python > Quizzes > Applied Datamining with Python
Applied Datamining with Python
Fast practice, instant feedback. Timer auto-submits when time’s up.
Avg score: 50% Most missed: “Given the time series object Calendar, which option shows multiple techniques fo…”

For Pluralsight Skill Assessments.

Applied Datamining with Python
Time left 00:00
18 Questions

1. What is a bag of words model?
2. What is a concern you must watch for while tuning the parameters of a decision tree classifier?
3. You are creating a corpus on which a classification model will be trained for sentiment analysis. After preprocessing the data and generating word frequency -you found 'i' present in the highest frequency. What step did you miss
4. Which option would build and train a basic Naive Bayes model that is used for classification and follows a normal distribution?
5. What is a bag of words model?
6. How does collinearity between features affect the regression?
7. Given the time series object Calendar, which option shows multiple techniques for selecting the value of index 1991-05-28 ?

Date|Cars Sold
--|---
20180101| 50
19919529 |25
19920630 |12
19910528 |29
8. You are creating a corpus on which a classification model will be trained for sentiment analysis. After preprocessing the data and generating word frequency -you found 'i' present in the highest frequency. What step did you miss
9. What is true of a directed network?
10. What is a bag of words model?
11. What is a bag of words model?
12. What is Survival Analysis?
13. Which of the following is NT a common metric for measuring regression models?
14. What is true of a directed network?
15. You are trying to find the monthly cost associated with raising children at a certain age. Given Y (monthly cost) and X (years old), you estimate the coefficients and plot a linear regression line. You observe that at three-years- old, a child should cost their parents $380 per month. What is the error value of the calculation
python
X (years old) 1 2 3 4
Y (monthly cost) 200 300 350 500
16. Given the time series object Calendar, which option shows multiple techniques for selecting the value of index 1991-05-28 ?

Date|Cars Sold
--|---
20180101| 50
19919529 |25
19920630 |12
19910528 |29
17. You built a link prediction model which resulted into a True Positive score of 12, a True Negative score of 10, a False Positive score of 36, and a False -Negative score of 18. What is the precision of the link prediction model
18. You are using the lifelines library to implement the Kaplan-Meier method on a dataset containing time, event, and color as variables. After fitting the model how would you plot the estimate?