Foldable operations (II)
Now, you'll use the function on partitions of the data set. You should realize that by performing this operation in pieces and then aggregating, you don't need to have all of the data in a variable at once. This point isn't that important with small data sets, like the mortgage sample data, but it is for large data sets.
This exercise is part of the course
Scalable Data Processing in R
Exercise instructions
The foldable_range()
function is available in your workspace.
- Split the rows of
mort
by the"year"
column. - Use
foldable_range()
to get the range of the"record_number"
column chunked by year.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Split the mortgage data by year
spl <- ___
# Use foldable_range() to get the range of the record numbers
foldable_range(___(function(s) ___(mort[s, "record_number"]), ___))