Lokale minima vermijden

In de vorige opgave zag je hoe makkelijk je vastloopt in lokale minima. We hadden een eenvoudig optimalisatieprobleem met één variabele en toch wist gradient descent het globale minimum niet te vinden, omdat het eerst door lokale minima moest. Een manier om dit te vermijden is momentum gebruiken, zodat de optimizer door lokale minima heen kan breken. We gebruiken opnieuw de verliesfunctie uit de vorige opgave, die al is gedefinieerd en beschikbaar is als loss_function().

De grafiek toont een ééndimensionale functie met meerdere lokale minima en één globaal minimum.

Verschillende optimizers in tensorflow hebben een momentumparameter, waaronder SGD en RMSprop. In deze oefening gebruik je RMSprop. Let op: x_1 en x_2 zijn dit keer op dezelfde waarde geïnitialiseerd. Bovendien is keras.optimizers.RMSprop() al voor je geïmporteerd uit tensorflow.

Deze oefening maakt deel uit van de cursus

Introductie tot TensorFlow in Python

Cursus bekijken

Oefeninstructies

Stel de bewerking opt_1 in met een learning rate van 0,01 en een momentum van 0,99.
Stel opt_2 in op de root mean square propagation (RMS) optimizer met een learning rate van 0,01 en een momentum van 0,00.
Definieer de minimalisatiebewerking voor opt_2.
Print x_1 en x_2 als numpy-arrays.

Praktische interactieve oefening

Probeer deze oefening eens door deze voorbeeldcode in te vullen.

# Initialize x_1 and x_2
x_1 = Variable(0.05,float32)
x_2 = Variable(0.05,float32)

# Define the optimization operation for opt_1 and opt_2
opt_1 = keras.optimizers.RMSprop(learning_rate=____, momentum=____)
opt_2 = ____

for j in range(100):
	opt_1.minimize(lambda: loss_function(x_1), var_list=[x_1])
    # Define the minimization operation for opt_2
	____

# Print x_1 and x_2 as numpy arrays
print(____, ____)

Code bewerken en uitvoeren

Deze oefening maakt deel uit van de cursus

Introductie tot TensorFlow in Python

SkillTag.level.intermediateSkillTag.label

4.8+

Begin de cursus gratis

Before you can build advanced models in TensorFlow 2, you will first need to understand the basics. In this chapter, you’ll learn how to define constants and variables, perform tensor addition and multiplication, and compute derivatives. Knowledge of linear algebra will be helpful, but not necessary.

Exercise 1: Constants and variables Exercise 2: Defining data as constants Exercise 3: Defining variables Exercise 4: Basic operations Exercise 5: Performing element-wise multiplication Exercise 6: Making predictions with matrix multiplication Exercise 7: Summing over tensor dimensions Exercise 8: Advanced operations Exercise 9: Reshaping tensors Exercise 10: Optimizing with gradients Exercise 11: Working with image data

In this chapter, you will learn how to build, solve, and make predictions with models in TensorFlow 2. You will focus on a simple class of models – the linear regression model – and will try to predict housing prices. By the end of the chapter, you will know how to load and manipulate data, construct loss functions, perform minimization, make predictions, and reduce resource use with batch training.

Exercise 1: Input data Exercise 2: Load data using pandas Exercise 3: Setting the data type Exercise 4: Loss functions Exercise 5: Loss functions in TensorFlow Exercise 6: Modifying the loss function Exercise 7: Linear regression Exercise 8: Set up a linear regression Exercise 9: Train a linear model Exercise 10: Multiple linear regression Exercise 11: Batch training Exercise 12: Preparing to batch train Exercise 13: Training a linear model in batches

The previous chapters taught you how to build models in TensorFlow 2. In this chapter, you will apply those same tools to build, train, and make predictions with neural networks. You will learn how to define dense layers, apply activation functions, select an optimizer, and apply regularization to reduce overfitting. You will take advantage of TensorFlow's flexibility by using both low-level linear algebra and high-level Keras API operations to define and train models.

Exercise 1: Dichte lagen Exercise 2: De lineaire algebra van dense lagen Exercise 3: De low-level aanpak met meerdere voorbeelden Exercise 4: De dense-laagoperatie gebruiken Exercise 5: Activatiefuncties Exercise 6: Binaire classificatieproblemen Exercise 7: Multiclass-classificatieproblemen Exercise 8: Optimizers Exercise 9: De gevaren van lokale minima Exercise 10: Lokale minima vermijden

Huidige oefening

Exercise 11: Een netwerk trainen in TensorFlow Exercise 12: Initialisatie in TensorFlow Exercise 13: Het model en de verliesfunctie definiëren Exercise 14: Neurale netwerken trainen met TensorFlow

In the final chapter, you'll use high-level APIs in TensorFlow 2 to train a sign language letter classifier. You will use both the sequential and functional Keras APIs to train, validate, make predictions with, and evaluate models. You will also learn how to use the Estimators API to streamline the model definition and training process, and to avoid errors.

Exercise 1: Defining neural networks with Keras Exercise 2: The sequential model in Keras Exercise 3: Compiling a sequential model Exercise 4: Defining a multiple input model Exercise 5: Training and validation with Keras Exercise 6: Training with Keras Exercise 7: Metrics and validation with Keras Exercise 8: Overfitting detection Exercise 9: Evaluating models Exercise 10: Training models with the Estimators API Exercise 11: Preparing to train with Estimators Exercise 12: Defining Estimators Exercise 13: Congratulations!