1. Learn
  2. /
  3. Courses
  4. /
  5. Parallel Programming with Dask in Python

Exercise

Dask arrays from Zarr datasets

You are tasked with analyzing European temperatures, and are given the same dataset which was in era_eu.hdf but this time in Zarr format. Zarr is a modern, powerful dataset format for storing chunked data. It is particularly good for use on cloud computing services but is also great on your own computer.

dask.array has been imported for you as da.

Instructions

100 XP
  • Using the from_zarr() function from the dask.array subpackage, load in the variable temp from the data/era_eu.zarr dataset.
  • Print the array to see the chunk sizes.
  • Find the minimum temperature across all time and space axes.
  • Compute the answer.