Visualizing messy data
Let's take a look at a new dataset - this one is a bit less-clean than what you've seen before.
As always, you'll first start by visualizing the raw data. Take a close look and try to find datapoints that could be problematic for fitting models.
The data has been loaded into a DataFrame called prices.
Deze oefening maakt deel uit van de cursus
Machine Learning for Time Series Data in Python
Oefeninstructies
- Visualize the time series data using Pandas.
- Calculate the number of missing values in each time series. Note any irregularities that you can see. What do you think they are?
Praktische interactieve oefening
Probeer deze oefening eens door deze voorbeeldcode in te vullen.
# Visualize the dataset
prices.____(legend=False)
plt.tight_layout()
plt.show()
# Count the missing values of each time series
missing_values = prices.____.____
print(missing_values)