Examining y, tc, and cdl
Now, you'll focus on tc
, hdl
, and y
, to understand the relationship between a few independent variables and the dependent one, disease progression. The diabetes dataset has been loaded as the DataFrame dia
.
Later in the course, you'll use a simulation to measure the impact of predictors on responses, so having an initial understanding of relationships between variables will help you understand your simulation results later on!
The following libraries have been imported for you: pandas
as pd
, numpy
as np
, matplotlib.pyplot
as plt
, and seaborn
as sns
.
This exercise is part of the course
Monte Carlo Simulations in Python
Exercise instructions
- Use
.pairplot()
to visually examinetc
,hdl
, andy
india
. - Measure the correlation coefficients between
tc
,hdl
, andy
india
.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Create a pairplot of tc, hdl, and y
____
plt.show()
# Calculate correlation coefficients
print(____)