Create a time series of air quality data
You have seen in the video how to deal with dates that are not in the correct format, but instead are provided as string
types, represented as dtype
object
in pandas
.
We have prepared a data set with air quality data (ozone, pm25, and carbon monoxide for NYC, 2000-2017) for you to practice the use of pd.to_datetime()
.
This exercise is part of the course
Manipulating Time Series Data in Python
Exercise instructions
We have already imported pandas
as pd
and matplotlib.pyplot
as plt
for you, and loaded the air quality DataFrame
into the variable data
.
- Inspect
data
using.info()
. - Use
pd.to_datetime
to convert thecolumn
'date'
todtype
datetime64
. - Set the
'date'
column
asindex
. - Validate the changes by inspecting
data
using.info()
again. - Plot
data
usingsubplots=True
.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
data = pd.read_csv('nyc.csv')
# Inspect data
print(____)
# Convert the date column to datetime64
# Set date column as index
# Inspect data
print(____)
# Plot data