Tuning bayesiano degli iperparametri con Hyperopt

In questo esempio imposterai ed eseguirai un processo di ottimizzazione bayesiana degli iperparametri usando il pacchetto Hyperopt (già importato come hp). Imposterai il dominio (simile all'impostazione della griglia in una grid search), poi definirai la funzione obiettivo. Infine, eseguirai l'ottimizzatore per 20 iterazioni.

Dovrai definire il dominio usando i valori:

max_depth usando una distribuzione quniform (tra 2 e 10, con passo 2)
learning_rate usando una distribuzione uniform (da 0.001 a 0.9)

Nota che, per questo esercizio, il processo è stato ridotto nella dimensione del campione di dati e nelle iterazioni di hyperopt e del GBM. Se provi questo metodo in autonomia sulla tua macchina, usa uno spazio di ricerca più ampio, più prove, più CV e un insieme di dati più grande per vederlo davvero all'opera!

Questo esercizio fa parte del corso

Ottimizzazione degli iperparametri in Python

Visualizza il corso

Istruzioni dell'esercizio

Definisci un dizionario space usando il dominio indicato sopra.
Definisci la funzione obiettivo usando un classificatore gradient boosting.
Esegui l'algoritmo per 20 valutazioni (usa semplicemente l'algoritmo predefinito consigliato nelle diapositive).

Esercizio pratico interattivo

Prova a risolvere questo esercizio completando il codice di esempio.

# Set up space dictionary with specified hyperparameters
space = {'max_depth': hp.____('max_depth', ____, ____, ____),'learning_rate': hp.____('learning_rate', ____,____)}

# Set up objective function
def objective(params):
    params = {'max_depth': int(params[____]),'learning_rate': params[____]}
    gbm_clf = ____(n_estimators=100, **params) 
    best_score = cross_val_score(gbm_clf, X_train, y_train, scoring='accuracy', cv=2, n_jobs=4).mean()
    loss = 1 - ____
    return ____

# Run the algorithm
best = fmin(fn=____,space=space, max_evals=____, rstate=np.random.default_rng(42), algo=tpe.suggest)
print(____)

Modifica ed esegui il codice

Questo esercizio fa parte del corso

Ottimizzazione degli iperparametri in Python

IntermediárioNível de habilidade

4.9+

Inizia il corso gratis

In this introductory chapter you will learn the difference between hyperparameters and parameters. You will practice extracting and analyzing parameters, setting hyperparameter values for several popular machine learning algorithms. Along the way you will learn some best practice tips & tricks for choosing which hyperparameters to tune and what values to set & build learning curves to analyze your hyperparameter choices.

Exercise 1: Introduction & 'Parameters'Exercise 2: Parameters in Logistic Regression Exercise 3: Extracting a Logistic Regression parameter Exercise 4: Extracting a Random Forest parameter Exercise 5: Introducing Hyperparameters Exercise 6: Hyperparameters in Random Forests Exercise 7: Exploring Random Forest Hyperparameters Exercise 8: Hyperparameters of KNN Exercise 9: Setting & Analyzing Hyperparameter Values Exercise 10: Automating Hyperparameter Choice Exercise 11: Building Learning Curves

This chapter introduces you to a popular automated hyperparameter tuning methodology called Grid Search. You will learn what it is, how it works and practice undertaking a Grid Search using Scikit Learn. You will then learn how to analyze the output of a Grid Search & gain practical experience doing this.

Exercise 1: Introducing Grid Search Exercise 2: Build Grid Search functions Exercise 3: Iteratively tune multiple hyperparameters Exercise 4: How Many Models?Exercise 5: Grid Search with Scikit Learn Exercise 6: GridSearchCV inputs Exercise 7: GridSearchCV with Scikit Learn Exercise 8: Understanding a grid search output Exercise 9: Using the best outputs Exercise 10: Exploring the grid search results Exercise 11: Analyzing the best results Exercise 12: Using the best results

In this chapter you will be introduced to another popular automated hyperparameter tuning methodology called Random Search. You will learn what it is, how it works and importantly how it differs from grid search. You will learn some advantages and disadvantages of this method and when to choose this method compared to Grid Search. You will practice undertaking a Random Search with Scikit Learn as well as visualizing & interpreting the output.

Exercise 1: Introducing Random Search Exercise 2: Randomly Sample Hyperparameters Exercise 3: Randomly Search with Random Forest Exercise 4: Visualizing a Random Search Exercise 5: Random Search in Scikit Learn Exercise 6: RandomSearchCV inputs Exercise 7: The RandomizedSearchCV Object Exercise 8: RandomSearchCV in Scikit Learn Exercise 9: Comparing Grid and Random Search Exercise 10: Comparing Random & Grid Search Exercise 11: Grid and Random Search Side by Side

In this final chapter you will be given a taste of more advanced hyperparameter tuning methodologies known as ''informed search''. This includes a methodology known as Coarse To Fine as well as Bayesian & Genetic hyperparameter tuning algorithms. You will learn how informed search differs from uninformed search and gain practical skills with each of the mentioned methodologies, comparing and contrasting them as you go.

Exercise 1: Ricerca informata: dal grossolano al fine Exercise 2: Visualizzare Coarse to Fine Exercise 3: Iterazioni Coarse to Fine Exercise 4: Ricerca informata: statistica bayesiana Exercise 5: La regola di Bayes in Python Exercise 6: Tuning bayesiano degli iperparametri con Hyperopt

Esercizio in corso

Exercise 7: Informed Search: Algoritmi genetici Exercise 8: Ottimizzazione genetica degli iperparametri con TPOT Exercise 9: Analizzare la stabilità di TPOT Exercise 10: Congratulazioni!