Get startedGet started for free

Examining y, tc, and cdl

Now, you'll focus on tc, hdl, and y, to understand the relationship between a few independent variables and the dependent one, disease progression. The diabetes dataset has been loaded as the DataFrame dia.

Later in the course, you'll use a simulation to measure the impact of predictors on responses, so having an initial understanding of relationships between variables will help you understand your simulation results later on!

The following libraries have been imported for you: pandas as pd, numpy as np, matplotlib.pyplot as plt, and seaborn as sns.

This exercise is part of the course

Monte Carlo Simulations in Python

View Course

Exercise instructions

  • Use .pairplot() to visually examine tc, hdl, and y in dia.
  • Measure the correlation coefficients between tc, hdl, and y in dia.

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Create a pairplot of tc, hdl, and y
____
plt.show()

# Calculate correlation coefficients
print(____)
Edit and Run Code