Get Started

Calculating Information Value

So far you have combined data from multiple sources and created new variables to derive insights from data. Do you think all these variables can explain turnover?

Information Value (IV) helps in measuring and ranking the variables on the basis of the predictive power of each variable. You can use Information Value (IV) to drop the variables which have very low predictive power.

This is a part of the course

“HR Analytics: Predicting Employee Churn in R”

View Course

Exercise instructions

  • Load the Information package.
  • Use the emp_final dataset from the previous exercise to find information value of all the variables in the dataset.
  • Print Information Value (IV) of each variable.

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Load Information package
___

# Compute Information Value 
IV <- create_infotables(data = ___, y = ___)

# Print Information Value 
___

This exercise is part of the course

HR Analytics: Predicting Employee Churn in R

IntermediateSkill Level
5.0+
1 reviews

Predict employee turnover and design retention strategies.

In this chapter, you will create new variables from existing data to explain employee turnover. You will analyze compensation data and create compa-ratio to measure pay equity of all employees. To identify the most important variables influencing turnover, you will use the concept of Information Value (IV).

Exercise 1: Feature engineeringExercise 2: Derive age differenceExercise 3: Derive job hop indexExercise 4: Derive employee tenureExercise 5: CompensationExercise 6: Exploring compensationExercise 7: Pay GapExercise 8: Deriving Compa-ratioExercise 9: Deriving Compa-levelExercise 10: Information valueExercise 11: Calculating Information Value
Exercise 12: Which variables are important?

What is DataCamp?

Learn the data skills you need online at your own pace—from non-coding essentials to data science and machine learning.

Start Learning for Free