Sequential Dataset
Good job building the create_sequences() function! It's time to use it to create a training dataset for your model.
Just like tabular and image data, sequential data is easiest passed to a model through a torch Dataset and DataLoader. To build a sequential Dataset, you will call create_sequences() to get the NumPy arrays with inputs and targets, and inspect their shape. Next, you will pass them to a TensorDataset to create a proper torch Dataset, and inspect its length.
Your implementation of create_sequences() and a DataFrame with the training data called train_data are available.
This exercise is part of the course
Intermediate Deep Learning with PyTorch
Exercise instructions
- Call
create_sequences(), passing it the training DataFrame and a sequence length of24*4, assigning the result toX_train, y_train. - Define
dataset_trainby callingTensorDatasetand passing it two arguments, the inputs and the targets created bycreate_sequences(), both converted from NumPy arrays to tensors of floats.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
import torch
from torch.utils.data import TensorDataset
# Use create_sequences to create inputs and targets
X_train, y_train = ____
print(X_train.shape, y_train.shape)
# Create TensorDataset
dataset_train = ____(
____,
____,
)
print(len(dataset_train))