Embedding in PyTorch

PyBooks had succes met een boekaanbevelingssysteem. Maar het houdt geen rekening met sommige semantiek in de tekst. De ingebouwde embeddinglaag van PyTorch kan de relatie tussen woorden rechtstreeks uit data leren en weergeven. Je team is benieuwd naar deze mogelijkheid om het aanbevelingssysteem te verbeteren. Kun jij helpen dit te implementeren?

torch en torch.nn als nn zijn al voor je geïmporteerd.

Deze oefening maakt deel uit van de cursus

Deep Learning voor tekst met PyTorch

Cursus bekijken

Oefeninstructies

Koppel een unieke index aan elk woord in words en sla dit op in word_to_idx.
Zet word_to_idx om naar een PyTorch-tensor en sla dit op in inputs.
InitialiseER een embeddinglaag met het torch-module met tien dimensies.
Geef de tensor inputs door aan de embeddinglaag en bekijk de output.

Praktische interactieve oefening

Probeer deze oefening eens door deze voorbeeldcode in te vullen.

# Map a unique index to each word
words = ["This", "book", "was", "fantastic", "I", "really", "love", "science", "fiction", "but", "the", "protagonist", "was", "rude", "sometimes"]
word_to_idx = {word: ____ for i, word in enumerate(____)}

# Convert word_to_idx to a tensor
inputs = ____.____([word_to_idx[w] for w in words])

# Initialize embedding layer with ten dimensions
embedding = nn.____(num_embeddings=len(words), embedding_dim=____)

# Pass the tensor to the embedding layer
output = embedding(____)
print(output)

Code bewerken en uitvoeren

Deze oefening maakt deel uit van de cursus

Deep Learning voor tekst met PyTorch

SkillTag.level.advancedSkillTag.label

4.8+

Begin de cursus gratis

This chapter introduces you to deep learning for text and its applications. Learn how to use PyTorch for text processing and get hands-on experience with techniques such as tokenization, stemming, stopword removal, and more. Understand the importance of encoding text data and implement encoding techniques using PyTorch. Finally, consolidate your knowledge by building a text processing pipeline combining these techniques.

Exercise 1: Introduction to preprocessing for text Exercise 2: Word frequency analysis Exercise 3: Preprocessing text Exercise 4: Encoding text data Exercise 5: One-hot encoded book titles Exercise 6: Bag-of-words for book titles Exercise 7: Applying TF-IDF to book descriptions Exercise 8: Introduction to building a text processing pipeline Exercise 9: Shakespearean language preprocessing pipeline Exercise 10: Shakespearean language encoder

Explore text classification and its role in Natural Language Processing (NLP). Apply your skills to implement word embeddings and develop both Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) for text classification using PyTorch, and understand how to evaluate your models using suitable metrics.

Exercise 1: Overzicht van tekstclassificatie Exercise 2: Embedding in PyTorch

Huidige oefening

Exercise 3: Tekstclassificatietaken categoriseren Exercise 4: Convolutionele neurale netwerken voor tekstclassificatie Exercise 5: Bouw een CNN-model voor tekst Exercise 6: Train een CNN-model voor tekst Exercise 7: Het Sentimentanalyse-CNN-model testen Exercise 8: Recurrent neural networks voor tekstclassificatie Exercise 9: Een RNN-model voor tekst bouwen Exercise 10: Een LSTM-model voor tekst bouwen Exercise 11: Een GRU-model bouwen voor tekst Exercise 12: Evaluatiemetrics voor tekstclassificatie Exercise 13: RNN-classificatiemodellen evalueren Exercise 14: De prestaties van het model evalueren Exercise 15: Modellen vergelijken

Venture into the exciting world of text generation and its applications in NLP. Understand how to leverage Recurrent Neural Networks (RNNs), Generative Adversarial Networks (GANs), and pre-trained models for text generation tasks using PyTorch. Alongside, you'll learn to evaluate the performance of your models using relevant metrics.

Exercise 1: Introduction to text generation Exercise 2: Creating a RNN model for text generation Exercise 3: Text generation using RNN - Training and Generation Exercise 4: Generative adversarial networks for text generation Exercise 5: Building a generator and discriminator Exercise 6: Training a GAN model Exercise 7: Pre-trained models for text generation Exercise 8: Text completion with pre-trained GPT-2 models Exercise 9: Language translation with pretrained PyTorch model Exercise 10: Evaluation metrics for text generation Exercise 11: Evaluating pretrained text generation model Exercise 12: Understanding text generation metrics

Understand the concept of transfer learning and its application in text classification. Explore Transformers, their architecture, and how to use them for text classification and generation tasks. You will also delve into attention mechanisms and their role in text processing. Finally, understand the potential impacts of adversarial attacks on text classification models and learn how to protect your models.

Exercise 1: Transfer learning for text classification Exercise 2: Transfer learning using BERT Exercise 3: Evaluating the BERT model Exercise 4: Transformers for text processing Exercise 5: Creating a transformer model Exercise 6: Training and testing the Transformer model Exercise 7: Attention mechanisms for text processing Exercise 8: Creating a RNN model with attention Exercise 9: Training and testing the RNN model with attention Exercise 10: Adversarial attacks on text classification models Exercise 11: Adversarial attack classification Exercise 12: Safeguarding AI at PyBooks Exercise 13: Wrap-up