LoslegenKostenlos loslegen

Saving preprocessed datasets

As part of your customer service chatbot project, you now have prepared a dataset for fine-tuning a Llama model. The next step is to save the dataset so that you can reload it later without having to repeat the preprocessing steps. This will allow your team to reuse the dataset across multiple experiments and iterations.

Diese Übung ist Teil des Kurses

Fine-Tuning with Llama 3

Kurs anzeigen

Anleitung zur Übung

  • Save the preprocessed dataset ds to disk.
  • Load the saved dataset into a new variable ds_preprocessed.
  • Print the first element of ds_preprocessed.

Interaktive Übung

Versuche dich an dieser Übung, indem du diesen Beispielcode vervollständigst.

from datasets import load_from_disk

# Save the dataset to disk
____

# Load the dataset from disk
ds_preprocessed = ____

# Print the first element of the loaded dataset
print(____)
Code bearbeiten und ausführen