Create a time series of air quality data
You have seen in the video how to deal with dates that are not in the correct format, but instead are provided as string types, represented as dtype object in pandas.
We have prepared a data set with air quality data (ozone, pm25, and carbon monoxide for NYC, 2000-2017) for you to practice the use of pd.to_datetime().
This exercise is part of the course
Manipulating Time Series Data in Python
Exercise instructions
We have already imported pandas as pd and matplotlib.pyplot as plt for you, and loaded the air quality DataFrame into the variable data.
- Inspect
datausing.info(). - Use
pd.to_datetimeto convert thecolumn'date'todtypedatetime64. - Set the
'date'columnasindex. - Validate the changes by inspecting
datausing.info()again. - Plot
datausingsubplots=True.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
data = pd.read_csv('nyc.csv')
# Inspect data
print(____)
# Convert the date column to datetime64
# Set date column as index
# Inspect data
print(____)
# Plot data