Get startedGet started for free

Simulating Data with Purrr

Often when trying to solve a problem with data we first need to build some simulated data to see if our idea is even possible. For example, you may want to test models with data that have known differences, to see if the models are working correctly.

In this exercise, you will see how this works in purrr by simulating data for two populations, a and b, from the sites: "north", "east", and "west". The two populations will be randomly drawn from a normal distribution, with different means and standard deviations.

This exercise is part of the course

Foundations of Functional Programming with purrr

View Course

Exercise instructions

  • Create a list of site names, "north", "east", and "west".
  • Then use map() to create a list of data frames with three columns, the first column is sites.
    • The second is population a, which has a mean of 5, a sample size n of 200, and an sd of (5/2).
    • The third is population b, which has a mean of 200, a sample size n of 200, and an sd of 15.

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# List of sites north, east, and west
sites <- list(___)

# Create a list of data frames, each with a years, a, and b column
list_of_df <-  map(___,  
  ~___(___ = .x,
       a = rnorm(mean = ___, n = ___, sd = ___),
       b = rnorm(mean = ___, n = ___, sd = ___)))

list_of_df
Edit and Run Code