Encoding shirt sizes
You have data for a consignment of t-shirts. The data includes the size of the shirt, which is given as either S, M, L or XL.
Here are the counts for the different sizes:
+----+-----+
|size|count|
+----+-----+
| S| 8|
| M| 15|
| L| 20|
| XL| 7|
+----+-----+
The sizes are first converted to an index using StringIndexer and then one-hot encoded using OneHotEncoder.
Which of the following is not true:
Deze oefening maakt deel uit van de cursus
Machine Learning with PySpark
Praktische interactieve oefening
Zet theorie om in actie met een van onze interactieve oefeningen.
Begin met trainen