Load the dataset
NannyML comes with a set of internal datasets in order to make it easier to demo use cases and test different algorithms. To load the dataset, you only need to use the nannyml.load_us_census_ma_employment_data()
function.
The function returns three Pandas DataFrame
objects: the reference set (the test set), the analysis set (unseen production data), and the ground truth for the analysis set. These data frames should be named according to the convention as reference
, analysis
, and analysis_gt
.
In this exercise, you will load the US Census Employment dataset and print the data frames to understand what they look like.
This exercise is part of the course
Monitoring Machine Learning in Python
Exercise instructions
- Import the
nannyml
libary. - Load the US Census Employment dataset from the
nannyml
library. - Print the head of the reference data.
- Print the head of the analysis data.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Import nannyml
import ____
# Load US Census Employment dataset
____, ____, ____ = ____.____()
# Print head of the reference data
____
# Print head of the analysis data
____