CommencerCommencer gratuitement

Inference on coefficients

Using the NYC Italian restaurants dataset (compiled by Simon Sheather in A Modern Approach to Regression with R), restNYC, you will investigate the effect on the significance of the coefficients when there are multiple variables in the model. Recall, the p-value associated with any coefficient is the probability of the observed data given that the particular variable is independent of the response AND given that all other variables are included in the model.

The following information relates to the dataset restNYC which is loaded into your workspace:

  • each row represents one customer survey from Italian restaurants in NYC
  • Price = price (in US$) of dinner (including tip and one drink)
  • Service = rating of the service (from 1 to 30)
  • Food = rating of the food (from 1 to 30)
  • Decor = rating of the decor (from 1 to 30)

Cet exercice fait partie du cours

Inference for Linear Regression in R

Afficher le cours

Instructions

  • Run a tidy lm regressing Price on Service.
  • Run a tidy lm regressing Price on Service, Food, and Decor.
  • What happened to the significance of Service when additional variables were added to the model?

Exercice interactif pratique

Essayez cet exercice en complétant cet exemple de code.

# Output the first model


# Output the second model
Modifier et exécuter le code