Adding plots to dvc.yaml
In this exercise, you are tasked to fill in the dvc.yaml file that outlines a model training process.
The files preprocess_dataset.py and train_and_evaluate.py are responsible for data preprocessing and model training/evaluation respectively, using weather.csv from the raw_dataset folder as input. The output of the model training code is the predictions.csv file, which includes the predictions and the actual values from the test dataset, and a metrics.json file that holds structured metrics data. The predictions.csv file will be utilized to create a confusion matrix plot.
This exercise is part of the course
Introduction to Data Versioning with DVC
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
Start Exercise