Dask arrays from Zarr datasets
You are tasked with analyzing European temperatures, and are given the same dataset which was in era_eu.hdf
but this time in Zarr format. Zarr is a modern, powerful dataset format for storing chunked data. It is particularly good for use on cloud computing services but is also great on your own computer.
dask.array
has been imported for you as da
.
This exercise is part of the course
Parallel Programming with Dask in Python
Exercise instructions
- Using the
from_zarr()
function from thedask.array
subpackage, load in the variabletemp
from thedata/era_eu.zarr
dataset. - Print the array to see the chunk sizes.
- Find the minimum temperature across all time and space axes.
- Compute the answer.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Load the temperature data from the Zarr dataset
temps = da.____(____)
# Print the Dask array of temperatures to see the chunk sizes
print(____)
# Find the minimum of the mean monthly temperatures
all_time_low = ____
# Compute the answer
all_time_low_value = ____
print(all_time_low_value, "°C")