J. Werfel, X. Xie, and H. Seung. In, MIT Press, (2003)Discussion of learning curves for stochastic gradient descent.
Besides gradient based approaches, the paper shortly describes (with additional references) weight perturbation and node perturbation approaches..
K. Bollacker, C. Evans, P. Paritosh, T. Sturge, and J. Taylor. SIGMOD '08: Proceedings of the 2008 ACM SIGMOD international conference on Management of data, page 1247--1250. New York, NY, USA, ACM, (2008)
M. Smucker, J. Allan, and B. Carterette. CIKM '07: Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, page 623--632. New York, NY, USA, ACM, (2007)
P. Scheir, P. Hofmair, M. Granitzer, and S. Lindstaedt. Semantic Systems From Vision to Applications - Proceedings of the
SEMANTICS 2006, Vienna, Austria, November 28-30, 2006, 291-301, Österreichische
Computer Gesellschaft, Wien, (2006)
V. Sabol, M. Granitzer, K. Tochtermann, and W. Sarka. Proc. 2nd European Workshop on the (Ref. No. 2005/11099) Integration
of Knowledge, Semantics and Digital Media Technology EWIMT 2005, page 349--355. (2005)
A. Rath, M. Kröll, S. Lindstaedt, and M. Granitzer. Proc. of the 4th Conference on Professional Knowledge Management, volume 2 of ProKW2007 Productive Knowledge Work - Management and Technological
Challenges, GITO Gmbh, Berlin, (March 2007)