Sequential Dataset
Good job building the create_sequences()
function! It's time to use it to create a training dataset for your model.
Just like tabular and image data, sequential data is easiest passed to a model through a torch Dataset and DataLoader. To build a sequential Dataset, you will call create_sequences()
to get the NumPy arrays with inputs and targets, and inspect their shape. Next, you will pass them to a TensorDataset
to create a proper torch Dataset, and inspect its length.
Your implementation of create_sequences()
and a DataFrame with the training data called train_data
are available.
This exercise is part of the course
Intermediate Deep Learning with PyTorch
Exercise instructions
- Call
create_sequences()
, passing it the training DataFrame and a sequence length of24*4
, assigning the result toX_train, y_train
. - Define
dataset_train
by callingTensorDataset
and passing it two arguments, the inputs and the targets created bycreate_sequences()
, both converted from NumPy arrays to tensors of floats.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
import torch
from torch.utils.data import TensorDataset
# Use create_sequences to create inputs and targets
X_train, y_train = ____
print(X_train.shape, y_train.shape)
# Create TensorDataset
dataset_train = ____(
____,
____,
)
print(len(dataset_train))