Carrier
In this exercise you'll create a StringIndexer and a OneHotEncoder to code the carrier column. To do this, you'll call the class constructors with the arguments inputCol and outputCol.
The inputCol is the name of the column you want to index or encode, and the outputCol is the name of the new column that the Transformer should create.
Bu egzersiz
Foundations of PySpark
kursunun bir parçasıdırEgzersiz talimatları
- Create a
StringIndexercalledcarr_indexerby callingStringIndexer()withinputCol="carrier"andoutputCol="carrier_index". - Create a
OneHotEncodercalledcarr_encoderby callingOneHotEncoder()withinputCol="carrier_index"andoutputCol="carrier_fact".
Uygulamalı interaktif egzersiz
Bu örnek kodu tamamlayarak bu egzersizi bitirin.
# Create a StringIndexer
carr_indexer = StringIndexer(____)
# Create a OneHotEncoder
carr_encoder = OneHotEncoder(____)