The effect of multicollinearity
Using the crab
dataset you will analyze the effects of multicollinearity. Recall that multicollinearity can have the following effects:
- Coefficient is not significant, but variable is highly correlated with \(y\).
- Adding/removing a variable significantly changes coefficients.
- Not logical sign of the coefficient.
- Variables have high pairwise correlation.
Este ejercicio forma parte del curso
Generalized Linear Models in Python
Instrucciones del ejercicio
- Import necessary functions from
statsmodels
library for GLMs. - Fit a multivariate logistic regression model with
weight
andwidth
as explanatory variables andy
as the response. - View model results using
print()
function.
Ejercicio interactivo práctico
Prueba este ejercicio y completa el código de muestra.
# Import statsmodels
import ____.____ as sm
from ____.____.____ import glm
# Define model formula
formula = '____ ~ ____'
# Fit GLM
model = glm(____, ____ = ____, ____ = sm.____.____).____
# Print model summary
____(____.____)