Fit the model(s)
You're finally ready to fit the models and select the best one!
Unfortunately, cross validation is a very computationally intensive procedure. Fitting all the models would take too long on DataCamp.
To do this locally you would use the code:
# Fit cross validation models
models = cv.fit(training)
# Extract the best model
best_lr = models.bestModel
Remember, the training data is called training
and you're using lr
to fit a logistic regression model. Cross validation selected the parameter values regParam=0
and elasticNetParam=0
as being the best. These are the default values, so you don't need to do anything else with lr
before fitting the model.
This exercise is part of the course
Foundations of PySpark
Exercise instructions
- Create
best_lr
by callinglr.fit()
on thetraining
data. - Print
best_lr
to verify that it's an object of theLogisticRegressionModel
class.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Call lr.fit()
best_lr = ____
# Print best_lr
print(____)