Dask arrays from Zarr datasets
You are tasked with analyzing European temperatures, and are given the same dataset which was in era_eu.hdf but this time in Zarr format. Zarr is a modern, powerful dataset format for storing chunked data. It is particularly good for use on cloud computing services but is also great on your own computer.
dask.array has been imported for you as da.
Deze oefening maakt deel uit van de cursus
Parallel Programming with Dask in Python
Oefeninstructies
- Using the
from_zarr()function from thedask.arraysubpackage, load in the variabletempfrom thedata/era_eu.zarrdataset. - Print the array to see the chunk sizes.
- Find the minimum temperature across all time and space axes.
- Compute the answer.
Praktische interactieve oefening
Probeer deze oefening eens door deze voorbeeldcode in te vullen.
# Load the temperature data from the Zarr dataset
temps = da.____(____)
# Print the Dask array of temperatures to see the chunk sizes
print(____)
# Find the minimum of the mean monthly temperatures
all_time_low = ____
# Compute the answer
all_time_low_value = ____
print(all_time_low_value, "°C")