Vidnes, Nadja (Master thesis, 2010)
Gradient descent (GD) is a popular approach for solving optimisation problems. A disadvantage
of the method is the di culties with choosing a proper learning rate that sets the step size for
the algorithm. If the learning ...