CV fine-tuning: dataset prep
In this exercise, you will prepare the Stanford Cars dataset for training. This will involve using the datasets
library to split the dataset and applying the preprocessing transformations. The dataset consists of 8k labeled images of 196 car models:
The dataset has been loaded as dataset
. The transformations have been defined for you as transforms
, and consist of renormalization and type conversion.
This exercise is part of the course
Multi-Modal Models with Hugging Face
Exercise instructions
- Create an 80/20 train/test split from
dataset
using the.train_test_split()
method. - Apply the transformations (
transforms
) to the dataset. - Plot the augmented image from the first set of pixel values in
dataset_transformed
.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Create a train/test split within the HF dataset
dataset = ____(test_size=____, seed=42)
# Apply the augmentations
dataset_transformed = ____
# Plot the augmented image
plt.imshow(dataset_transformed["train"][0]["____"].permute(1, 2, 0))
plt.show()