1
Introduction to CTR and Basic Techniques
Free
Chances are you’re on this page because you clicked a link. In this chapter, you’ll learn why click-through-rates (CTR) are integral to targeted advertising, how to perform basic DataFrame manipulation, and how you can use machine learning models to predict CTR.
2
Exploratory CTR Data Analysis
This chapter provides the foundations for exploratory data analysis (EDA). Using sample data you’ll use the pandas library to look at columns and data types, explore missing data, and use hashing to perform feature engineering on categorical features. All of which are important when exploring features for more accurate CTR prediction.
3
Model Applications and Improvements
It’s time to dive deeper. Find out how you can use measures of model performance including precision and recall to answer real-world questions, such as evaluating ROI on ad spend. You’ll also learn ways to improve upon those evaluation metrics, such as ensemble methods and hyperparameter tuning.
4
Deep Learning
Profits can be heavily impacted by your campaign’s CTR. In this chapter, you’ll learn how deep learning can be used to reduce that risk. You’ll focus on multi-layer perceptron (MLP) and neural network models, and learn how these can be used to capture the complex relationship between variables to more accurately predict CTR. Lastly, you’ll explore how to apply the basics of hyperparameter tuning and regularization to classification models.

Initializing

Log normalization

Standardization is important to make sure all features are comparable. Log normalization is a common method of standardization. You will check the variance of select features and compute the overall median variance among features. The features will be the numeric ones, except for the click column, banner_pos, device_type, and columns search_engine_type, product_type, advertiser_type from last lesson since they are actually categorical columns. Then you will apply log normalization to these columns with a variance higher than the median variance and check results.

The pandas module is available as pd in your workspace and the sample DataFrame is loaded as df.

Select the numeric columns and exclude particular columns using .isin() on filter_cols.
Print out the variance of each column using .var() (don't forget the parentheses!).
Calculate and print the median variance using .median().