Separating Target and Features
In order to make a prediction (in this case, whether an employee would leave or not), one needs to separate the dataset into two components:
- the dependent variable or target which needs to be predicted
- the independent variables or features that will be used to make a prediction
Your task is to separate the target
and features
. The target you have here is the employee churn, and features include everything else.
Reminder: the dataset has already been modified by encoding categorical variables and getting dummies.
pandas
has been imported for you as pd
.
This exercise is part of the course
HR Analytics: Predicting Employee Churn in Python
Exercise instructions
- Set the target and features:
- Choose the dependent variable column (
churn
) and set it astarget
. .drop()
the columnchurn
to set everything else asfeatures
.
- Choose the dependent variable column (
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Set the target and features
# Choose the dependent variable column (churn) and set it as target
target = data.____
# Drop column churn and set everything else as features
features = data.____("____",axis=1)