1
Recurrent Neural Networks and Keras
Free
In this chapter, you will learn the foundations of Recurrent Neural Networks (RNN). Starting with some prerequisites, continuing to understanding how information flows through the network and finally seeing how to implement such models with Keras in the sentiment classification task.
2
RNN Architecture
You will learn about the vanishing and exploding gradient problems, often occurring in RNNs, and how to deal with them with the GRU and LSTM cells. Furthermore, you'll create embedding layers for language models and revisit the sentiment classification task.
3
Multi-Class Classification
Next, in this chapter you will learn how to prepare data for the multi-class classification task, as well as the differences between multi-class classification and binary classification (sentiment analysis). Finally, you will learn how to create models and measure their performance with Keras.
4
Sequence to Sequence Models
This chapter introduces you to two applications of RNN models: Text Generation and Neural Machine Translation. You will learn how to prepare the text data to the format needed by the models. The Text Generation model is used for replicating a character's way of speech and will have some fun mimicking Sheldon from The Big Bang Theory. Neural Machine Translation is used for example by Google Translate in a much more complex model. In this chapter, you will create a model that translates Portuguese small phrases into English.

Initializing

Preparing the input text

You have seen in the video how to prepare the input and output texts. This exercise aims to show a common practice that is to use the maximum length of the sentences to pad all of them, this way no information will be lost.

Since the RNN models need the inputs to have the same size, this is a way to pad all sentences and just add zeros to the smaller sentences, without cutting the larger ones.

Also, you will use words instead of characters to represent the tokens, this is a common approach for NMT models.

The Portuguese texts are loaded on the pt_sentences variable and a fitted tokenizer on the input_tokenizer variable.

Use the .split() method on each sentence to split by white space and obtain the number of words in the sentence.
Use the .texts_to_sequences() method to transform text into sequence of indexes.
Use the obtained maximum length of sentences to pad them.
Print the first transformed sentence.