RandomSearchCV no Scikit Learn

Vamos praticar a construção de um objeto RandomizedSearchCV usando Scikit Learn.

A grade de hiperparâmetros deve incluir max_depth (todos os valores entre 5 e 25, inclusive) e max_features ('auto' e 'sqrt').

As opções desejadas para o objeto RandomizedSearchCV são:

Um Estimator RandomForestClassifier com n_estimators igual a 80.
Validação cruzada com 3 folds (cv)
Usar roc_auc para pontuar os modelos
Usar 4 núcleos para processamento em paralelo (n_jobs)
Garantir que você faça refit do melhor modelo e retorne as pontuações de treinamento
Amostrar apenas 5 modelos por eficiência (n_iter)

Os conjuntos de dados X_train e y_train já estão carregados para você.

Lembre-se: para extrair os hiperparâmetros escolhidos, eles estão em cv_results_, com uma coluna por hiperparâmetro. Por exemplo, a coluna do hiperparâmetro criterion seria param_criterion.

Este exercicio faz parte do curso

Ajuste de Hiperparâmetros em Python

Instruções do exercicio

Crie uma grade de hiperparâmetros conforme especificado no contexto acima.
Crie um objeto RandomizedSearchCV conforme descrito no contexto acima.
Ajuste o objeto RandomizedSearchCV aos dados de treinamento.
Faça indexação em cv_results_ para imprimir os valores escolhidos pelo processo de modelagem para ambos os hiperparâmetros (max_depth e max_features).

exercicio interativo prático

Tente este exercicio completando este código de exemplo.

# Create the parameter grid
param_grid = {'max_depth': list(range(____,26)), 'max_features': [____ , ____]} 

# Create a random search object
random_rf_class = RandomizedSearchCV(
    estimator = ____(n_estimators=____),
    param_distributions = ____, n_iter = ____,
    scoring=____, n_jobs=____, cv = ____, refit=____, return_train_score = ____ )

# Fit to the training data
____.fit(X_train, y_train)

# Print the values used for both hyperparameters
print(random_rf_class.cv_results_[____])
print(random_rf_class.cv_results_[____])

Editar e Executar Código

Este exercicio faz parte do curso

Ajuste de Hiperparâmetros em Python

IntermediárioNível de habilidade

4.9+

Comece o curso gratuitamente

In this introductory chapter you will learn the difference between hyperparameters and parameters. You will practice extracting and analyzing parameters, setting hyperparameter values for several popular machine learning algorithms. Along the way you will learn some best practice tips & tricks for choosing which hyperparameters to tune and what values to set & build learning curves to analyze your hyperparameter choices.

Exercise 1: Introduction & 'Parameters'Exercise 2: Parameters in Logistic Regression Exercise 3: Extracting a Logistic Regression parameter Exercise 4: Extracting a Random Forest parameter Exercise 5: Introducing Hyperparameters Exercise 6: Hyperparameters in Random Forests Exercise 7: Exploring Random Forest Hyperparameters Exercise 8: Hyperparameters of KNN Exercise 9: Setting & Analyzing Hyperparameter Values Exercise 10: Automating Hyperparameter Choice Exercise 11: Building Learning Curves

This chapter introduces you to a popular automated hyperparameter tuning methodology called Grid Search. You will learn what it is, how it works and practice undertaking a Grid Search using Scikit Learn. You will then learn how to analyze the output of a Grid Search & gain practical experience doing this.

Exercise 1: Introducing Grid Search Exercise 2: Build Grid Search functions Exercise 3: Iteratively tune multiple hyperparameters Exercise 4: How Many Models?Exercise 5: Grid Search with Scikit Learn Exercise 6: GridSearchCV inputs Exercise 7: GridSearchCV with Scikit Learn Exercise 8: Understanding a grid search output Exercise 9: Using the best outputs Exercise 10: Exploring the grid search results Exercise 11: Analyzing the best results Exercise 12: Using the best results

In this chapter you will be introduced to another popular automated hyperparameter tuning methodology called Random Search. You will learn what it is, how it works and importantly how it differs from grid search. You will learn some advantages and disadvantages of this method and when to choose this method compared to Grid Search. You will practice undertaking a Random Search with Scikit Learn as well as visualizing & interpreting the output.

Exercise 1: Introdução ao Random Search Exercise 2: Amostre hiperparâmetros aleatoriamente Exercise 3: Busca aleatória com Random Forest Exercise 4: Visualizando um Random Search Exercise 5: Random Search no Scikit Learn Exercise 6: Entradas do RandomSearchCV Exercise 7: O objeto RandomizedSearchCV Exercise 8: RandomSearchCV no Scikit Learn

Exercicio Atual

Exercise 9: Comparando Grid Search e Random Search Exercise 10: Comparando Random Search e Grid Search Exercise 11: Grid e Random Search lado a lado

In this final chapter you will be given a taste of more advanced hyperparameter tuning methodologies known as ''informed search''. This includes a methodology known as Coarse To Fine as well as Bayesian & Genetic hyperparameter tuning algorithms. You will learn how informed search differs from uninformed search and gain practical skills with each of the mentioned methodologies, comparing and contrasting them as you go.

Exercise 1: Informed Search: Coarse to Fine Exercise 2: Visualizing Coarse to Fine Exercise 3: Coarse to Fine Iterations Exercise 4: Informed Search: Bayesian Statistics Exercise 5: Bayes Rule in Python Exercise 6: Bayesian Hyperparameter tuning with Hyperopt Exercise 7: Informed Search: Genetic Algorithms Exercise 8: Genetic Hyperparameter Tuning with TPOT Exercise 9: Analysing TPOT's stability Exercise 10: Congratulations!