Scaling data - standardizing columns
Since we know that the Ash
, Alcalinity of ash
, and Magnesium
columns in the wine
dataset are all on different scales, let's standardize them in a way that allows for use in a linear model.
Diese Übung ist Teil des Kurses
Preprocessing for Machine Learning in Python
Anleitung zur Übung
- Import the
StandardScaler
class. - Instantiate a
StandardScaler()
and store it in the variable,scaler
. - Create a subset of the
wine
DataFrame containing theAsh
,Alcalinity of ash
, andMagnesium
columns, assign it towine_subset
. - Fit and transform the standard scaler to
wine_subset
.
Interaktive Übung
Versuche dich an dieser Übung, indem du diesen Beispielcode vervollständigst.
# Import StandardScaler
from sklearn.preprocessing import ____
# Create the scaler
scaler = ____
# Subset the DataFrame you want to scale
____ = wine[[____]]
# Apply the scaler to wine_subset
wine_subset_scaled = scaler.____(____)