Session Ready
Exercise

Dask dataframe

To practice working with Dask dataframes, we will

  • read in a .csv file containing the diabetes dataset as Dask dataframe,
  • create a new binary variable from the age column, and
  • compute the means of all variables for the resulting two age groups.

The code in this exercise could easily be adapted to work with a Pandas dataframe instead of a Dask dataframe.

Instructions
100 XP
  • Read in a csv file using a dask.dataframe method.
  • Compute the column means across the two age groups in bin_age.