Jaringan LSTM

Seperti yang sudah Anda ketahui, sel RNN biasa tidak terlalu sering digunakan dalam praktik. Alternatif yang lebih sering dipakai dan jauh lebih baik dalam menangani urutan panjang adalah Long Short-Term Memory (LSTM). Pada latihan ini, Anda akan membangun jaringan LSTM sendiri!

Perbedaan implementasi terpenting dibandingkan jaringan RNN yang Anda buat sebelumnya berasal dari fakta bahwa LSTM memiliki dua status tersembunyi, bukan satu. Artinya, Anda perlu menginisialisasi status tersembunyi tambahan ini dan meneruskannya ke sel LSTM.

torch dan torch.nn sudah diimpor untuk Anda, jadi mulailah menulis kode!

Latihan ini adalah bagian dari kursus

Deep Learning Lanjutan dengan PyTorch

Petunjuk latihan

Di metode .__init__(), definisikan sebuah layer LSTM dan tetapkan ke self.lstm.
Di metode forward(), inisialisasi status tersembunyi memori jangka panjang pertama c0 dengan nol.
Di metode forward(), teruskan ketiga masukan ke layer LSTM: masukan pada langkah waktu saat ini, serta sebuah tuple yang berisi dua status tersembunyi.

Latihan interaktif praktis

Cobalah latihan ini dengan menyelesaikan kode contoh berikut.

class Net(nn.Module):
    def __init__(self, input_size):
        super().__init__()
        # Define lstm layer
        ____ = ____(
            input_size=1,
            hidden_size=32,
            num_layers=2,
            batch_first=True,
        )
        self.fc = nn.Linear(32, 1)

    def forward(self, x):
        h0 = torch.zeros(2, x.size(0), 32)
        # Initialize long-term memory
        c0 = ____
        # Pass all inputs to lstm layer
        out, _ = ____
        out = self.fc(out[:, -1, :])
        return out

Edit dan Jalankan Kode

Latihan ini adalah bagian dari kursus

Deep Learning Lanjutan dengan PyTorch

SkillTag.level.intermediateSkillTag.label

4.8+

Mulai Kursus Gratis

Learn how to train neural networks in a robust way. In this chapter, you will use object-oriented programming to define PyTorch datasets and models and refresh your knowledge of training and evaluating neural networks. You will also get familiar with different optimizers and, finally, get to grips with various techniques that help mitigate the problems of unstable gradients so ubiquitous in neural nets training.

Exercise 1: PyTorch and object-oriented programming Exercise 2: PyTorch Dataset Exercise 3: PyTorch DataLoader Exercise 4: PyTorch Model Exercise 5: Optimizers, training, and evaluation Exercise 6: Training loop Exercise 7: Optimizers Exercise 8: Model evaluation Exercise 9: Vanishing and exploding gradients Exercise 10: Initialization and activation Exercise 11: Activations: ReLU vs. ELU Exercise 12: Batch Normalization

Train neural networks to solve image classification tasks. In this chapter, you will learn how to handle image data in PyTorch and get to grips with convolutional neural networks (CNNs). You will practice training and evaluating an image classifier while learning about how to improve the model performance with data augmentation.

Exercise 1: Handling images with PyTorch Exercise 2: Image dataset Exercise 3: Data augmentation Exercise 4: Data augmentation in PyTorch Exercise 5: Convolutional Neural Networks Exercise 6: The convolutional layer Exercise 7: Building convolutional networks Exercise 8: Training image classifiers Exercise 9: Choosing augmentations Exercise 10: Dataset with augmentations Exercise 11: Image classifier training loop Exercise 12: Evaluating image classifiers Exercise 13: Multi-class model evaluation Exercise 14: Analyzing metrics per class

Build and train recurrent neural networks (RNNs) for processing sequential data such as time series, text, or audio. You will learn about the two most popular recurrent architectures, Long-Short Term Memory (LSTM) and Gated Recurrent Unit (GRU) networks, as well as how to prepare sequential data for model training. You will practice your skills by training and evaluating a recurrent model for predicting electricity consumption.

Exercise 1: Menangani sekuens dengan PyTorch Exercise 2: Membuat sekuens Exercise 3: Himpunan Data Sekuensial Exercise 4: Recurrent Neural Networks Exercise 5: Arsitektur sekuensial Exercise 6: Membangun RNN untuk peramalan Exercise 7: Sel LSTM dan GRU Exercise 8: RNN vs. LSTM vs. GRU Exercise 9: Jaringan LSTM

Latihan Saat Ini

Exercise 10: Jaringan GRU Exercise 11: Melatih dan mengevaluasi RNN Exercise 12: RNN training loop Exercise 13: Mengevaluasi model peramalan

Build multi-input and multi-output models, demonstrating how they can handle tasks requiring more than one input or generating multiple outputs. You will explore how to design and train these models using PyTorch and delve into the crucial topic of loss weighting in multi-output models. This involves understanding how to balance the importance of different tasks when training a model to perform multiple tasks simultaneously.

Exercise 1: Multi-input models Exercise 2: Two-input dataset Exercise 3: Two-input model Exercise 4: Training two-input model Exercise 5: Multi-output models Exercise 6: Two-output Dataset and DataLoader Exercise 7: Two-output model architecture Exercise 8: Training multi-output models Exercise 9: Evaluation of multi-output models and loss weighting Exercise 10: Multi-output model evaluation Exercise 11: Loss weighting Exercise 12: Wrap-up