Loading datasets
Hugging Face built the datasets
package for interacting with datasets, providing many convenient functions, including load_dataset_builder()
, which you just used. After inspecting a dataset to ensure it suits your project, it's time to load it!
For this, we can use input parameters for load_dataset()
to specify which parts of the dataset to load. In this case, you'll load the "test" split of the "TIGER-Lab/MMLU-Pro" dataset, a benchmark evaluation dataset we inspected in the previous exercise.
The load_dataset
module from the datasets
package is already loaded for you.
This exercise is part of the course
Working with Hugging Face
Exercise instructions
- Use the correct function to load the "TIGER-Lab/MMLU-Pro" dataset and specify the "test" split.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Load the "test" split of the TIGER-Lab/MMLU-Pro dataset
my_dataset = ____("____", split="____")
# Display dataset details
print(my_dataset)