Make it better. Make it better again. Make it better with derivatives. Make it better with acceleration. Make it better even though things are nonconvex because I’m using a deep network and things are stochastic because I’m using minibatching. Stuff like that.

Youre gonna see topics like:

list of papers