Creating dummy variables
Being able to include categorical features in the model building process can enhance performance as they may add information that contributes to prediction accuracy.
The music_df
dataset has been preloaded for you, and its shape is printed. Also, pandas
has been imported as pd
.
Now you will create a new DataFrame containing the original columns of music_df
plus dummy variables from the "genre"
column.
Questo esercizio fa parte del corso
Supervised Learning with scikit-learn
Istruzioni dell'esercizio
- Use a relevant function, passing the entire
music_df
DataFrame, to createmusic_dummies
, dropping the first binary column. - Print the shape of
music_dummies
.
Esercizio pratico interattivo
Prova questo esercizio completando il codice di esempio.
# Create music_dummies
music_dummies = ____
# Print the new DataFrame's shape
print("Shape of music_dummies: {}".format(____))