Weighted sampling
Stratified sampling provides rules about the probability of picking rows from your dataset at the subgroup level. A generalization of this is weighted sampling, which lets you specify rules about the probability of picking rows at the row level. The probability of picking any given row is proportional to the weight value for that row.
attrition_pop
is available; pandas
, matplotlib.pyplot
, and numpy
are loaded with their usual aliases.
This exercise is part of the course
Sampling in Python
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Plot YearsAtCompany from attrition_pop as a histogram
____
plt.show()