1. Learn
  2. /
  3. Courses
  4. /
  5. Pandas Joins for Spreadsheet Users

Exercise

Case study challenge - part 1

It's time to build a usable dataset from start to finish! Your team will be using this data to train a model that predicts the likelihood of a concussion on each play. The first part of your challenge is to create a dataset with new game data from the 2017 and 2018 seasons. You have four data frames loaded:

  • matchups_2017: teams playing in each game for the 2017 season

  • matchups_2018: teams playing in each game for the 2018 season

  • conditions_2017: locations and weather of each game for the 2017 season

  • conditions_2018: locations and weather of each game for the 2018 season

Instructions

100 XP
  • Combine the match-up data.
  • Combine the game conditions data.
  • Create one game-level data frame based on the newly combined data frames.