1. Learn
  2. /
  3. Courses
  4. /
  5. Practicing Statistics Interview Questions in R

Exercise

Classification evaluation

In the previous lesson, you have built a logistic model to predict Parkinson's disease. In this exercise, you will compare predictions against actual values.

This skill is vital because companies focus on the results. Employers want to know how accurate the models that you develop are.

Once again, you will build a model to predict the status of Parkinson's disease. This time, you will build the model on part of the dataset and use the rest for testing.

Remember that \(\text{recall} = \frac{TP}{TP+FN}\).

\(TP\) means that we have predicted a positive value and we were right.

\(FN\) means that we have predicted a negative value but we were wrong.

Around 80% of the rows of the parkinsons dataset have been assigned to train, and the rest have been assigned to test.

Instructions 1/3

undefined XP
    1
    2
    3
  • Build a logistic model that explains status by NHR and DFA using the train data.
  • Compute the probabilities of having Parkinson's disease for the test data.