Aan de slagGa gratis aan de slag

Dask arrays from Zarr datasets

You are tasked with analyzing European temperatures, and are given the same dataset which was in era_eu.hdf but this time in Zarr format. Zarr is a modern, powerful dataset format for storing chunked data. It is particularly good for use on cloud computing services but is also great on your own computer.

dask.array has been imported for you as da.

Deze oefening maakt deel uit van de cursus

Parallel Programming with Dask in Python

Cursus bekijken

Oefeninstructies

  • Using the from_zarr() function from the dask.array subpackage, load in the variable temp from the data/era_eu.zarr dataset.
  • Print the array to see the chunk sizes.
  • Find the minimum temperature across all time and space axes.
  • Compute the answer.

Praktische interactieve oefening

Probeer deze oefening eens door deze voorbeeldcode in te vullen.

# Load the temperature data from the Zarr dataset
temps = da.____(____)

# Print the Dask array of temperatures to see the chunk sizes
print(____)

# Find the minimum of the mean monthly temperatures
all_time_low = ____

# Compute the answer
all_time_low_value = ____

print(all_time_low_value, "°C")
Code bewerken en uitvoeren