Variation in Sample Statistics

If we create one sample of size=1000 by drawing that many points from a population. Then compute a sample statistic, such as the mean, a single value that summarizes the sample itself.

If you repeat that sampling process num_samples=100 times, you get 100 samples. Computing the sample statistic, like the mean, for each of the different samples, will result in a distribution of values of the mean. The goal then is to compute the mean of the means and standard deviation of the means.

Here you will use the preloaded population, num_samples, and num_pts, and note that the means and deviations arrays have been initialized to zero to give you containers to use for the for loop.

For each of num_samples=100, generate a sample, then compute and storing the sample statistics.
For each iteration, create a sample by using np.random.choice() to draw 1000 random points from the population.
For each iteration, compute and store the methods sample.mean() and sample.std() to compute the mean and standard deviation of the sample.
For the array of means and the array of deviations, compute both the mean and standard deviation of each, and print the results.

script.py

IPython Shell

Exploring Linear Trends

Building Linear Models

Making Model Predictions

Estimating Model Parameters

Exercise

Exercise

Variation in Sample Statistics

Instructions