Session Ready
Exercise

Looking at pigeon data

In this second chapter you will analyze some truly unique data sets.

The American Racing Pigeon Union has a great database of race results. A subset of this data is available in a data frame pigeon, that's available in your workspace.

A first thing you'll do with this data, is investigating whether flying speed is related to the color of the pigeons' feathers. Next, you'll use ggplot2 to create a plot of the pigeon's speed against their position.

Instructions
100 XP
  • Have a look at the chained set of dplyr functions. The first part retains pigeons with a Color that appears at least 10 times in the dataset. Finish the final group_by() and summarize() calls to calculate the average Speed, grouped per Color. The resulting data frame should have two columns: Color and AvgSpeed. Simply print out this data frame.
  • Finish the ggplot() command:
    • Map Pos onto x and Speed onty y.
    • Label the x-axis with "Rank" and the y-axis with "Pigeon Speed".
    • Finish off by adding the title "The correlation between speed and rank".