Training the model

You will train the previously implemented model in this exercise. Do you know that the Google's encoder-decoder based machine translation model took 2-4 days to train?

For this exercise you will be using a small dataset of 1500 sentences (i.e. en_text and fr_text) to train the model. This amount will hardly be enough to see good performance, but the method will remain the same. It is a matter of training on more data for longer. You have also been provided with the model nmt, and sents2seqs() function that you implemented previously. You will be reversing the encoder text to get better performance in this exercise. Here, en_x refers to the encoder input, where de_x refers to the decoder input.

Get a single batch of encoder inputs (English sentences from index i to i+bsize) using the sents2seqs() function. Inputs need to be reversed and onehot encoded.
Get a single batch of decoder outputs (French sentences from index i to i+bsize) using the sents2seqs() function. Inputs need to be onehot encoded.
Train the model on a single batch of data containing en_x and de_y.
Obtain the evaluation metrics for en_x and de_y by evaluating the model with a batch_size of bsize.

Introduction to Machine Translation

Implementing an Encoder-Decoder Model with Keras

Training and Generating Translations

Teacher Forcing and Word Embeddings

Exercise

Training the model

Instructions