Examining y, tc, and cdl
Now, you'll focus on tc, hdl, and y, to understand the relationship between a few independent variables and the dependent one, disease progression. The diabetes dataset has been loaded as the DataFrame dia.
Later in the course, you'll use a simulation to measure the impact of predictors on responses, so having an initial understanding of relationships between variables will help you understand your simulation results later on!
The following libraries have been imported for you: pandas as pd, numpy as np, matplotlib.pyplot as plt, and seaborn as sns.
Deze oefening maakt deel uit van de cursus
Monte Carlo Simulations in Python
Oefeninstructies
- Use
.pairplot()to visually examinetc,hdl, andyindia. - Measure the correlation coefficients between
tc,hdl, andyindia.
Praktische interactieve oefening
Probeer deze oefening eens door deze voorbeeldcode in te vullen.
# Create a pairplot of tc, hdl, and y
____
plt.show()
# Calculate correlation coefficients
print(____)