Gradient descent

Intuition:

so we reduced

Usual assumptions in literature are Beta-smooth and Alpha-strongly convex

Now we analyse gradient descent with the update .

For the above version of gradient descent

Using induction, we conclude the result.

The quantity is called “condition number” of .

Condition number of is now 0.01 which is slow.
To make it faster we can do a change of variables .
Then the condition number is 1.

Notes on Cambridge Maths