We are starting to add more steps into the machine learning workflow. Think about when we implemented upsampling to deal with class imbalance. Which data set did we upsample?