The dangers of local minima

Consider the plot of the following loss function, loss_function(), which contains a global minimum, marked by the dot on the right, and several local minima, including the one marked by the dot on the left.

The graph is of a single variable function that contains multiple local minima and a global minimum.

In this exercise, you will try to find the global minimum of loss_function() using keras.optimizers.SGD(). You will do this twice, each time with a different initial value of the input to loss_function(). First, you will use x_1, which is a variable with an initial value of 6.0. Second, you will use x_2, which is a variable with an initial value of 0.3. Note that loss_function() has been defined and is available.

Set opt to use the stochastic gradient descent optimizer (SGD) with a learning rate of 0.01.
Perform minimization using the loss function, loss_function(), and the variable with an initial value of 6.0, x_1.
Perform minimization using the loss function, loss_function(), and the variable with an initial value of 0.3, x_2.
Print x_1 and x_2 as numpy arrays and check whether the values differ. These are the minima that the algorithm identified.

Introduction to TensorFlow

Linear models

Neural Networks

High Level APIs

Exercise

The dangers of local minima

Instructions