Adding plots to dvc.yaml
In this exercise, you are tasked to fill in the dvc.yaml
file that outlines a model training process.
The files preprocess_dataset.py
and train_and_evaluate.py
are responsible for data preprocessing and model training/evaluation respectively, using weather.csv
from the raw_dataset
folder as input. The output of the model training code is the predictions.csv
file, which includes the predictions and the actual values from the test dataset, and a metrics.json
file that holds structured metrics data. The predictions.csv
file will be utilized to create a confusion matrix plot.
This exercise is part of the course
Introduction to Data Versioning with DVC
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
