Aan de slagGa gratis aan de slag

Encoding categorical variables - binary

Take a look at the hiking dataset. There are several columns here that need encoding before they can be modeled, one of which is the Accessible column. Accessible is a binary feature, so it has two values, Y or N, which need to be encoded into 1's and 0's. Use scikit-learn's LabelEncoder method to perform this transformation.

Deze oefening maakt deel uit van de cursus

Preprocessing for Machine Learning in Python

Cursus bekijken

Oefeninstructies

  • Store LabelEncoder() in a variable named enc.
  • Using the encoder's .fit_transform() method, encode the hiking dataset's "Accessible" column. Call the new column Accessible_enc.
  • Compare the two columns side-by-side to see the encoding.

Praktische interactieve oefening

Probeer deze oefening eens door deze voorbeeldcode in te vullen.

# Set up the LabelEncoder object
enc = ____

# Apply the encoding to the "Accessible" column
____ = ____.____(____)

# Compare the two columns
print(____[[____, ____]].head())
Code bewerken en uitvoeren