De beste resultaten gebruiken

Hoewel het interessant is om de resultaten van onze grid search te analyseren, is ons uiteindelijke doel praktisch: we willen voorspellingen doen op onze testset met ons estimatorobject.

We kunnen dit object benaderen via de eigenschap best_estimator_ van ons grid search-object.

Laten we in de eigenschap best_estimator_ kijken, voorspellingen maken en evaluatiescores genereren. We gebruiken eerst de standaard predict (die klassevoorspellingen geeft), maar daarna moeten we predict_proba gebruiken in plaats van predict om de roc-auc-score te berekenen, omdat roc-auc kansscores nodig heeft voor de berekening. We gebruiken een slice [:,1] om de kansen van de positieve klasse te pakken.

Je hebt de gegevenssets X_test en y_test beschikbaar en het object grid_rf_class uit eerdere oefeningen.

Deze oefening maakt deel uit van de cursus

Hyperparameter Tuning in Python

Cursus bekijken

Oefeninstructies

Controleer het type van de eigenschap best_estimator_.
Gebruik de eigenschap best_estimator_ om voorspellingen te doen op onze testset.
Genereer een verwarringsmatrix en ROC_AUC-score op basis van onze voorspellingen.

Praktische interactieve oefening

Probeer deze oefening eens door deze voorbeeldcode in te vullen.

# See what type of object the best_estimator_ property is
print(____(____.____))

# Create an array of predictions directly using the best_estimator_ property
predictions = grid_rf_class.____._____(X_test)

# Take a look to confirm it worked, this should be an array of 1's and 0's
print(predictions[0:5])

# Now create a confusion matrix 
print("Confusion Matrix \n", confusion_matrix(y_test, ______))

# Get the ROC-AUC score
predictions_proba = grid_rf_class.best_estimator_.predict_proba(X_test)[:,1]
print("ROC-AUC Score \n", roc_auc_score(y_test, _____))

Code bewerken en uitvoeren

Deze oefening maakt deel uit van de cursus

Hyperparameter Tuning in Python

SkillTag.level.intermediateSkillTag.label

4.9+

Begin de cursus gratis

In this introductory chapter you will learn the difference between hyperparameters and parameters. You will practice extracting and analyzing parameters, setting hyperparameter values for several popular machine learning algorithms. Along the way you will learn some best practice tips & tricks for choosing which hyperparameters to tune and what values to set & build learning curves to analyze your hyperparameter choices.

Exercise 1: Introduction & 'Parameters'Exercise 2: Parameters in Logistic Regression Exercise 3: Extracting a Logistic Regression parameter Exercise 4: Extracting a Random Forest parameter Exercise 5: Introducing Hyperparameters Exercise 6: Hyperparameters in Random Forests Exercise 7: Exploring Random Forest Hyperparameters Exercise 8: Hyperparameters of KNN Exercise 9: Setting & Analyzing Hyperparameter Values Exercise 10: Automating Hyperparameter Choice Exercise 11: Building Learning Curves

This chapter introduces you to a popular automated hyperparameter tuning methodology called Grid Search. You will learn what it is, how it works and practice undertaking a Grid Search using Scikit Learn. You will then learn how to analyze the output of a Grid Search & gain practical experience doing this.

Exercise 1: Introductie van Grid Search Exercise 2: Bouw Grid Search-functies Exercise 3: Iteratief meerdere hyperparameters afstemmen Exercise 4: Hoeveel modellen?Exercise 5: Grid search met Scikit Learn Exercise 6: GridSearchCV-invoer Exercise 7: GridSearchCV met Scikit Learn Exercise 8: De uitkomst van een grid search begrijpen Exercise 9: De beste resultaten gebruiken Exercise 10: De grid-searchresultaten verkennen Exercise 11: De beste resultaten analyseren Exercise 12: De beste resultaten gebruiken

Huidige oefening

In this chapter you will be introduced to another popular automated hyperparameter tuning methodology called Random Search. You will learn what it is, how it works and importantly how it differs from grid search. You will learn some advantages and disadvantages of this method and when to choose this method compared to Grid Search. You will practice undertaking a Random Search with Scikit Learn as well as visualizing & interpreting the output.

Exercise 1: Introducing Random Search Exercise 2: Randomly Sample Hyperparameters Exercise 3: Randomly Search with Random Forest Exercise 4: Visualizing a Random Search Exercise 5: Random Search in Scikit Learn Exercise 6: RandomSearchCV inputs Exercise 7: The RandomizedSearchCV Object Exercise 8: RandomSearchCV in Scikit Learn Exercise 9: Comparing Grid and Random Search Exercise 10: Comparing Random & Grid Search Exercise 11: Grid and Random Search Side by Side

In this final chapter you will be given a taste of more advanced hyperparameter tuning methodologies known as ''informed search''. This includes a methodology known as Coarse To Fine as well as Bayesian & Genetic hyperparameter tuning algorithms. You will learn how informed search differs from uninformed search and gain practical skills with each of the mentioned methodologies, comparing and contrasting them as you go.

Exercise 1: Informed Search: Coarse to Fine Exercise 2: Visualizing Coarse to Fine Exercise 3: Coarse to Fine Iterations Exercise 4: Informed Search: Bayesian Statistics Exercise 5: Bayes Rule in Python Exercise 6: Bayesian Hyperparameter tuning with Hyperopt Exercise 7: Informed Search: Genetic Algorithms Exercise 8: Genetic Hyperparameter Tuning with TPOT Exercise 9: Analysing TPOT's stability Exercise 10: Congratulations!