Get startedGet started for free

Dask arrays from Zarr datasets

You are tasked with analyzing European temperatures, and are given the same dataset which was in era_eu.hdf but this time in Zarr format. Zarr is a modern, powerful dataset format for storing chunked data. It is particularly good for use on cloud computing services but is also great on your own computer.

dask.array has been imported for you as da.

This exercise is part of the course

Parallel Programming with Dask in Python

View Course

Exercise instructions

  • Using the from_zarr() function from the dask.array subpackage, load in the variable temp from the data/era_eu.zarr dataset.
  • Print the array to see the chunk sizes.
  • Find the minimum temperature across all time and space axes.
  • Compute the answer.

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Load the temperature data from the Zarr dataset
temps = da.____(____)

# Print the Dask array of temperatures to see the chunk sizes
print(____)

# Find the minimum of the mean monthly temperatures
all_time_low = ____

# Compute the answer
all_time_low_value = ____

print(all_time_low_value, "°C")
Edit and Run Code