ComeçarComece de graça

Train variation

Time to work with US monthly train ridership data! Let's begin by exploring the variation in monthly train ridership. To understand a dataset beyond averages, the variance is an extremely useful statistic. As the name suggests, it gives a sense for the variation that exists in the data. That is, how far from the mean each point is.

As a reminder, to calculate the variance of a population:

  1. Calculate the mean of the entire dataset.
  2. Subtract each value from the mean.
  3. Square the differences to ensure positive and negative values don't cancel each other out.
  4. Take the average of the squared differences.

To fully understand variance, in this exercise you will first follow the above steps to calculate the variance manually and then use the VARP() function to automatically calculate variance.

Este exercício faz parte do curso

Introduction to Statistics in Google Sheets

Ver curso

Instruções do exercício

  • In cell D2, calculate the difference between B2 and the mean train ridership (AVERAGE($B$2:$B$160)). Do the same for the rest of the column.
  • In column E, square each of the differences in column D using ^2.
  • Calculate the variance by calculating the mean of E2:E160 in F2.
  • Use VARP() on B2:B160 to concisely calculate the variance.

Exercício interativo prático

Transforme a teoria em ação com um de nossos exercícios interativos

Começar o exercício