Creating a dummy from a two-category variable
Given is a basetable with one predictive variable "gender". Make sure that "gender" can be used as a predictive variable in a logistic regression model by creating dummy variables for it.
Diese Übung ist Teil des Kurses
Intermediate Predictive Analytics in Python
Anleitung zur Übung
- Create a pandas dataframe
dummies_genderthat has the dummy variables for "gender". Make sure to avoid multicollinearity. - Add these dummies to the original
basetable. - Remove the original variable "gender" from the
basetable.
Interaktive Übung
Vervollständige den Beispielcode, um diese Übung erfolgreich abzuschließen.
# Create the dummy variable
dummies_gender = pd.____(____["____"], drop_first=____)
# Add the dummy variable to the basetable
basetable = pd.concat([____, ____], axis=1)
# Delete the original variable from the basetable
del basetable["____"]
print(basetable.head())