Scaling data - standardizing columns
Since we know that the Ash, Alcalinity of ash, and Magnesium columns in the wine dataset are all on different scales, let's standardize them in a way that allows for use in a linear model.
Deze oefening maakt deel uit van de cursus
Preprocessing for Machine Learning in Python
Oefeninstructies
- Import the
StandardScalerclass. - Instantiate a
StandardScaler()and store it in the variable,scaler. - Create a subset of the
wineDataFrame containing theAsh,Alcalinity of ash, andMagnesiumcolumns, assign it towine_subset. - Fit and transform the standard scaler to
wine_subset.
Praktische interactieve oefening
Probeer deze oefening eens door deze voorbeeldcode in te vullen.
# Import StandardScaler
from sklearn.preprocessing import ____
# Create the scaler
scaler = ____
# Subset the DataFrame you want to scale
____ = wine[[____]]
# Apply the scaler to wine_subset
wine_subset_scaled = scaler.____(____)