Statistical inference with Maryland crime data

1. Statistical inference with Maryland crime data

Applying mixed-effect models requires the use of statistical inferences. During this lesson, we will talk about two different methods with mixed effect models: Null-hypothesis testing of covariates and applying analysis of variance or ANOVAs to compare models. We'll also introduce a new dataset, the Maryland crime dataset.

2. Maryland crime data

Maryland provides their annual crime data on data dot gov. Their data include the number of violent crimes per county. Policy analysts, academics, and private sector interests, such as insurance companies, are interested in how these numbers change over time. These numbers are reported on the county-level. Each county can, and likely does, have its own changes over time. We will be exploring this data and comparing different methods for statistical inferences using it.

3. Null hypothesis test

Most introductory statistics courses cover null hypothesis testing as part of frequentist statistics. This approach compares a model or parameters within a model to a straw man hypothesis in that no effect occurs. These can be used within mixed-effect models to estimate if parameters vary due to chance alone. You will see how to view the output from lmer models in the exercises.

4. P-values with lmer

By default, the lme4 package does not include p-values. There are several reasons for this. First, p-values cannot be estimated for the random-effects because these are latent variables without standard deviations. Second, estimating p-values for fixed-effects within a mixed-effect model is currently an open research question, which includes an on-going debate on the best practices for how to calculate degrees of freedom. However, several ad-hoc packages do exist. One such package is lmerTest. We'll use this package in our exercises.

5. ANOVA

Analysis of variance or ANOVA is a powerful statistical tool. Usually, it is used to compare variance within and between groups to see if the groups differ from each other. ANOVA can also be used to compare mixed-effect models. When applying ANOVAs to mixed-effect models, we compare the variability explained by one model to the variability explained by another model. The model that best explains the variability is the one we use. For example, if we wondered whether a specific response variable was important, we could build two models, one with the parameter and one without. The model that the ANOVA said did a better job of explaining the variability would be the model we want to use with our data.

6. Summary

We've gone over two methods for comparing mixed-effect models. These methods allow us to build and compare models as well as determine what predictor variables are important. I've only provided a high-level overview of these approaches.

7. Let's practice!

Now, let's use these methods to compare mixed-effect models!

This exercise is part of the course

Hierarchical and Mixed Effects Models in R

AdvancedSkill Level

4.6+

Start Course for Free

The first chapter provides an example of when to use a mixed-effect and also describes the parts of a regression. The chapter also examines a student test-score dataset with a nested structure to demonstrate mixed-effects.

Exercise 1: What is a hierarchical model?Exercise 2: Examples of hierarchical datasets Exercise 3: Multi-level student data Exercise 4: Exploring multiple-levels: Classrooms and schools Exercise 5: Parts of a regression Exercise 6: Intercepts Exercise 7: Slopes and multiple regression Exercise 8: Random-effects in regressions with school data Exercise 9: Random-effect intercepts Exercise 10: Random-effect slopes Exercise 11: Building the school model Exercise 12: Interpreting the school model

This chapter providers an introduction to linear mixed-effects models. It covers different types of random-effects, describes how to understand the results for linear mixed-effects models, and goes over different methods for statistical inference with mixed-effects models using crime data from Maryland.

Exercise 1: Linear mixed effect model- Birth rates data Exercise 2: Building a lmer model with random effects Exercise 3: Including a fixed effect Exercise 4: Random-effect slopes Exercise 5: Uncorrelated random-effect slope Exercise 6: Fixed- and random-effect predictor Exercise 7: Understanding and reporting the outputs of a lmer Exercise 8: Comparing print and summary output Exercise 9: Extracting coefficients Exercise 10: Displaying the results from a lmer model Exercise 11: Statistical inference with Maryland crime data

Current Exercise

Exercise 12: Visualizing Maryland crime data Exercise 13: Rescaling slopes Exercise 14: Null hypothesis testing Exercise 15: Controversies around P-values Exercise 16: Model comparison with ANOVA

This chapter extends linear mixed-effects models to include non-normal error terms using generalized linear mixed-effects models. By altering the model to include a non-normal error term, you are able to model more kinds of data with non-linear responses. After reviewing generalized linear models, the chapter examines binomial data and count data in the context of mixed-effects models.

Exercise 1: Crash course on GLMs Exercise 2: Logistic regression Exercise 3: Poisson Regression Exercise 4: Plotting GLMs Exercise 5: Binomial data Exercise 6: Toxicology data Exercise 7: Marketing example Exercise 8: Calculating odds-ratios Exercise 9: Count data Exercise 10: Internet click-throughs Exercise 11: Chlamydia by age-group and county Exercise 12: Displaying chlamydia results

This chapter shows how repeated-measures analysis is a special case of mixed-effect modeling. The chapter begins by reviewing paired t-tests and repeated measures ANOVA. Next, the chapter uses a linear mixed-effect model to examine sleep study data. Lastly, the chapter uses a generalized linear mixed-effect model to examine hate crime data from New York state through time.

Exercise 1: An introduction to repeated measures Exercise 2: Paired t-test Exercise 3: Repeated measures ANOVA Exercise 4: Sleep study Exercise 5: Exploring the data Exercise 6: Building models Exercise 7: Comparing regressions and ANOVAs Exercise 8: Plotting results Exercise 9: Hate in NY state?Exercise 10: Exploring NY hate data Exercise 11: Building the model Exercise 12: Interpreting model results Exercise 13: Displaying the results Exercise 14: Hierarchical models in R review Exercise 15: Conclusion