PyTorch Transformers
Now you're familiar with the different components of the transformer architecture, it's time to define one! The torch.nn
module, imported for you as nn
, provides a really nice way to do this in just a few lines of code.
Cet exercice fait partie du cours
Transformer Models with PyTorch
Instructions
- Define a transformer with
8
attention heads,6
encoder and decoder layers, and for input sequence embeddings of length1536
. - Print the model object to view the model architecture.
Exercice interactif pratique
Essayez cet exercice en complétant cet exemple de code.
# Define the transformer model
model = ____
# Print the model object
print(____)