Simulating Data with Purrr
Often when trying to solve a problem with data we first need to build some simulated data to see if our idea is even possible. For example, you may want to test models with data that have known differences, to see if the models are working correctly.
In this exercise, you will see how this works in purrr
by simulating data for two populations, a
and b
, from the sites: "north", "east", and "west". The two populations will be randomly drawn from a normal distribution, with different means and standard deviations.
This exercise is part of the course
Foundations of Functional Programming with purrr
Exercise instructions
- Create a list of site names, "north", "east", and "west".
- Then use
map()
to create a list of data frames with three columns, the first column issites
.- The second is population
a
, which has amean
of 5, a sample sizen
of 200, and ansd
of (5/2). - The third is population
b
, which has amean
of 200, a sample sizen
of 200, and ansd
of 15.
- The second is population
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# List of sites north, east, and west
sites <- list(___)
# Create a list of data frames, each with a years, a, and b column
list_of_df <- map(___,
~___(___ = .x,
a = rnorm(mean = ___, n = ___, sd = ___),
b = rnorm(mean = ___, n = ___, sd = ___)))
list_of_df