CommencerCommencer gratuitement

Encoding categorical variables - binary

Take a look at the hiking dataset. There are several columns here that need encoding before they can be modeled, one of which is the Accessible column. Accessible is a binary feature, so it has two values, Y or N, which need to be encoded into 1's and 0's. Use scikit-learn's LabelEncoder method to perform this transformation.

Cet exercice fait partie du cours

Preprocessing for Machine Learning in Python

Afficher le cours

Instructions

  • Store LabelEncoder() in a variable named enc.
  • Using the encoder's .fit_transform() method, encode the hiking dataset's "Accessible" column. Call the new column Accessible_enc.
  • Compare the two columns side-by-side to see the encoding.

Exercice interactif pratique

Essayez cet exercice en complétant cet exemple de code.

# Set up the LabelEncoder object
enc = ____

# Apply the encoding to the "Accessible" column
____ = ____.____(____)

# Compare the two columns
print(____[[____, ____]].head())
Modifier et exécuter le code