Aan de slagGa gratis aan de slag

Describe time series data with boxplots

You should always explore the distribution of the variables, and because you are working with time series, you will explore their properties using boxplots and numerical summaries. As a reminder, you can plot data in a DataFrame as boxplots with the command:

df.boxplot(fontsize=6, vert=False)

Notice the introduction of the new parameter vert, which specifies whether to plot the boxplots horizontally or vertically.

Deze oefening maakt deel uit van de cursus

Visualizing Time Series Data in Python

Cursus bekijken

Oefeninstructies

  • Generate a boxplot of all the time series in jobs.
  • Print out a numerical statistical summary of all the time series in jobs.
  • Review the results and print the name of the time series with the highest mean value and with the most variability (i.e., with the highest standard deviation).

Praktische interactieve oefening

Probeer deze oefening eens door deze voorbeeldcode in te vullen.

# Generate a boxplot
____.____(fontsize=6, vert=False)
plt.show()

# Generate numerical summaries
print(____)

# Print the name of the time series with the highest mean
print(____)

# Print the name of the time series with the highest variability
print(____)
Code bewerken en uitvoeren