Log normalization in Python
Now that we know that the Proline
column in our wine
dataset has a large amount of variance, let's log normalize it.
numpy
has been imported as np
.
This exercise is part of the course
Preprocessing for Machine Learning in Python
Exercise instructions
- Print out the variance of the
Proline
column for reference. - Use the
np.log()
function on theProline
column to create a new, log-normalized column namedProline_log
. - Print out the variance of the
Proline_log
column to see the difference.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Print out the variance of the Proline column
print(____)
# Apply the log normalization function to the Proline column
wine[____] = np.____(____)
# Check the variance of the normalized Proline column
print(____)