1. Learn
  2. /
  3. Courses
  4. /
  5. Predictive Analytics using Networked Data in R

Exercise

Split into train and test

Now that we have a dataframe, we can apply standard techniques for modeling. In this exercise, you will split the data into training and test sets.

Instructions

100 XP
  • To ensure the reproducibility of your results, set a seed to 7, using set.seed().
  • Use the sample() function to sample two-thirds of the numbers from the sequence from the range of the total number of rows in studentnetworkdata. Name this vector index_train.
  • Create the training set by including the rows of studentnetworkdata that are stored in index_train and name it training_set.
  • Create the test set by excluding the rows of studentnetworkdata that are stored in index_train and name it test_set.