Get startedGet started for free

Examining tc, ldl, and hdl

The diabetes dataset, dia, will be used as the real-world example for both this chapter and the next. Proper data exploration is a foundation for performing effective Monte Carlo simulations, so you'll continue exploring the data in the exercises!

In this exercise, you'll focus on three variables: tc, ldl, and hdl. The dia DataFrame has been loaded for you.

The following libraries have been imported for you: pandas as pd, numpy as np, matplotlib.pyplot as plt, and seaborn as sns.

This exercise is part of the course

Monte Carlo Simulations in Python

View Course

Exercise instructions

  • Use the pairplot() function in seaborn to visually examine the relationship between the columns tc, ldl, and hdl in dia (specified in that order).
  • Use the .corr() method from pandas to measure the correlation coefficients between tc, ldl, and hdl in dia (specified in that order).

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Create a pairplot of tc, ldl, and hdl
____(dia[[____]])
plt.show()

# Calculate correlation coefficients
print(____)
Edit and Run Code