Valutazione completa

Ricorda che precision e recall possono avere pesi diversi e quindi il punteggio F-beta è una metrica di valutazione importante. Inoltre, la curva ROC e la sua AUC sono metriche complementari fondamentali rispetto a precision e recall, dato che hai visto in precedenza come un modello possa avere un’AUC alta ma una precision bassa. In questo esercizio calcolerai l’intero set di metriche di valutazione per ciascun classificatore.

È fornita una funzione print_estimator_name() che restituisce il nome di ciascun classificatore. X_train, y_train, X_test, y_test sono disponibili nel tuo workspace e le feature sono già state standardizzate. pandas come pd e sklearn sono anch’essi disponibili nel tuo workspace.

Questo esercizio fa parte del corso

Prevedere il CTR con il Machine Learning in Python

Visualizza il corso

Istruzioni dell'esercizio

Definisci un classificatore MLP con un livello nascosto di 10 unità e un massimo di 50 iterazioni.
Allena e predici per ciascun classificatore.
Usa le implementazioni di sklearn per ottenere precision, recall, punteggio F-beta e l’AUC della ROC.

Esercizio pratico interattivo

Prova a risolvere questo esercizio completando il codice di esempio.

# Create classifiers
clfs = [LogisticRegression(), DecisionTreeClassifier(), RandomForestClassifier(), 
        ____(____ = (10, ), ____ = 50)]

# Produce all evaluation metrics for each classifier
for clf in clfs:
  print("Evaluating classifier: %s" %(print_estimator_name(clf)))
  y_score = clf.fit(X_train, y_train).____(X_test)
  y_pred = clf.fit(X_train, y_train).____(X_test)
  prec = ____(y_test, y_pred, average = 'weighted')
  recall = ____(y_test, y_pred, average = 'weighted')
  fbeta = ____(y_test, y_pred, beta = 0.5, average = 'weighted')
  roc_auc = ____(y_test, y_score[:, 1])
  print("Precision: %s: Recall: %s, F-beta score: %s, AUC of ROC curve: %s" 
        %(prec, recall, fbeta, roc_auc))

Modifica ed esegui il codice

Questo esercizio fa parte del corso

Prevedere il CTR con il Machine Learning in Python

IntermediárioNível de habilidade

4.9+

Inizia il corso gratis

Chances are you’re on this page because you clicked a link. In this chapter, you’ll learn why click-through-rates (CTR) are integral to targeted advertising, how to perform basic DataFrame manipulation, and how you can use machine learning models to predict CTR.

Exercise 1: Introduction to click-through rates Exercise 2: Beginning steps Exercise 3: Feature exploration Exercise 4: First evaluation of data Exercise 5: Overview of machine learning models Exercise 6: Logistic regression for breast cancer Exercise 7: Logistic regression for images Exercise 8: A second toy model Exercise 9: CTR prediction using decision trees Exercise 10: Model implementation Exercise 11: A first CTR model Exercise 12: Beyond only accuracy

This chapter provides the foundations for exploratory data analysis (EDA). Using sample data you’ll use the pandas library to look at columns and data types, explore missing data, and use hashing to perform feature engineering on categorical features. All of which are important when exploring features for more accurate CTR prediction.

Exercise 1: Exploratory data analysis Exercise 2: A first look Exercise 3: Checking for missing values Exercise 4: Distributions by CTR Exercise 5: Feature engineering Exercise 6: Analyzing datetime columns Exercise 7: Converting categorical variables Exercise 8: Creating new features Exercise 9: Standardizing features Exercise 10: Log normalization Exercise 11: Understanding standardization Exercise 12: Standard scaling

It’s time to dive deeper. Find out how you can use measures of model performance including precision and recall to answer real-world questions, such as evaluating ROI on ad spend. You’ll also learn ways to improve upon those evaluation metrics, such as ensemble methods and hyperparameter tuning.

Exercise 1: Applications of metric evaluation Exercise 2: Four categories of outcomes Exercise 3: Evaluating four categories Exercise 4: ROI on ad spend Exercise 5: Model evaluation Exercise 6: Precision and recall Exercise 7: Baseline Exercise 8: Classifier comparison Exercise 9: Tuning models Exercise 10: Regularization Exercise 11: Cross validation Exercise 12: Model selection Exercise 13: Ensembles and hyperparameter tuning Exercise 14: Understanding hyperparameter tuning Exercise 15: Random forests Exercise 16: Grid search

Profits can be heavily impacted by your campaign’s CTR. In this chapter, you’ll learn how deep learning can be used to reduce that risk. You’ll focus on multi-layer perceptron (MLP) and neural network models, and learn how these can be used to capture the complex relationship between variables to more accurately predict CTR. Lastly, you’ll explore how to apply the basics of hyperparameter tuning and regularization to classification models.

Exercise 1: Introduzione al deep learning Exercise 2: Capire gli MLP Exercise 3: Modello iniziale Exercise 4: MLP per il CTR Exercise 5: Ottimizzazione degli iperparametri nel deep learning Exercise 6: Ottimizzazione degli iperparametri negli MLP Exercise 7: Variare gli iperparametri Exercise 8: Grid search per MLP Exercise 9: Valutazione del modello Exercise 10: Punteggio F-beta Exercise 11: Bassa precision e AUC alta Exercise 12: Precision, ROI e AUC Exercise 13: Revisione e confronto dei modelli Exercise 14: Riscaldamento al confronto tra modelli Exercise 15: Valutare precision e ROI Exercise 16: Valutazione completa

Esercizio in corso

Exercise 17: Video di riepilogo