IniziaInizia gratis

Carrier

In this exercise you'll create a StringIndexer and a OneHotEncoder to code the carrier column. To do this, you'll call the class constructors with the arguments inputCol and outputCol.

The inputCol is the name of the column you want to index or encode, and the outputCol is the name of the new column that the Transformer should create.

Questo esercizio fa parte del corso

Foundations of PySpark

Visualizza il corso

Istruzioni dell'esercizio

  • Create a StringIndexer called carr_indexer by calling StringIndexer() with inputCol="carrier" and outputCol="carrier_index".
  • Create a OneHotEncoder called carr_encoder by calling OneHotEncoder() with inputCol="carrier_index" and outputCol="carrier_fact".

Esercizio pratico interattivo

Prova a risolvere questo esercizio completando il codice di esempio.

# Create a StringIndexer
carr_indexer = StringIndexer(____)

# Create a OneHotEncoder
carr_encoder = OneHotEncoder(____)
Modifica ed esegui il codice