1. Learn
  2. /
  3. Courses
  4. /
  5. Sampling in Python

Connected

Exercise

Is systematic sampling OK?

Systematic sampling has a problem: if the data has been sorted, or there is some sort of pattern or meaning behind the row order, then the resulting sample may not be representative of the whole population. The problem can be solved by shuffling the rows, but then systematic sampling is equivalent to simple random sampling.

Here you'll look at how to determine whether or not there is a problem.

attrition_pop is available; pandas is loaded as pd, and matplotlib.pyplot as plt.

Instructions 1/3

undefined XP
    1
    2
    3
  • Add an index column to attrition_pop, assigning the result to attrition_pop_id.
  • Create a scatter plot of YearsAtCompany versus index for attrition_pop_id using pandas .plot().