Encoding shirt sizes
You have data for a consignment of t-shirts. The data includes the size of the shirt, which is given as either S, M, L or XL.
Here are the counts for the different sizes:
+----+-----+
|size|count|
+----+-----+
| S| 8|
| M| 15|
| L| 20|
| XL| 7|
+----+-----+
The sizes are first converted to an index using StringIndexer and then one-hot encoded using OneHotEncoder.
Which of the following is not true:
Questo esercizio fa parte del corso
Machine Learning with PySpark
Esercizio pratico interattivo
Passa dalla teoria alla pratica con uno dei nostri esercizi interattivi
Inizia esercizio