J. Werfel, X. Xie, and H. Seung. In, MIT Press, (2003)Discussion of learning curves for stochastic gradient descent.
Besides gradient based approaches, the paper shortly describes (with additional references) weight perturbation and node perturbation approaches..