ComenzarEmpieza gratis

Train variation

Time to work with US monthly train ridership data! Let's begin by exploring the variation in monthly train ridership. To understand a dataset beyond averages, the variance is an extremely useful statistic. As the name suggests, it gives a sense for the variation that exists in the data. That is, how far from the mean each point is.

As a reminder, to calculate the variance of a population:

  1. Calculate the mean of the entire dataset.
  2. Subtract each value from the mean.
  3. Square the differences to ensure positive and negative values don't cancel each other out.
  4. Take the average of the squared differences.

To fully understand variance, in this exercise you will first follow the above steps to calculate the variance manually and then use the VARP() function to automatically calculate variance.

Este ejercicio forma parte del curso

Introduction to Statistics in Google Sheets

Ver curso

Instrucciones del ejercicio

  • In cell D2, calculate the difference between B2 and the mean train ridership (AVERAGE($B$2:$B$160)). Do the same for the rest of the column.
  • In column E, square each of the differences in column D using ^2.
  • Calculate the variance by calculating the mean of E2:E160 in F2.
  • Use VARP() on B2:B160 to concisely calculate the variance.

Ejercicio interactivo práctico

Pon en práctica la teoría con uno de nuestros ejercicios interactivos

Empieza el ejercicio