Feature engineering
You are tasked with predicting whether or not a new cohort of loan applicants are likely to default on their loans. You have a historical dataset and wish to train a classifier on it. You notice that many features are in string format, which is a problem for your classifiers. You hence decide to encode the string columns numerically using LabelEncoder()
. The function has been preloaded for you from the preprocessing
submodule of sklearn
. The dataset credit
is also preloaded, as is a list of all column names whose data types are string, stored in non_numeric_columns
.
Este ejercicio forma parte del curso
Designing Machine Learning Workflows in Python
Ejercicio interactivo práctico
Prueba este ejercicio completando el código de muestra.
# Inspect the first few lines of your data using head()
credit.____