Author of the publication

Training Deep and Recurrent Networks with Hessian-Free Optimization.

, and . Neural Networks: Tricks of the Trade (2nd ed.), volume 7700 of Lecture Notes in Computer Science, Springer, (2012)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

On the Expressive Efficiency of Sum Product Networks., and . CoRR, (2014)Kronecker-factored Curvature Approximations for Recurrent Neural Networks., , and . ICLR (Poster), OpenReview.net, (2018)Optimizing Neural Networks with Kronecker-factored Approximate Curvature., and . ICML, volume 37 of JMLR Workshop and Conference Proceedings, page 2408-2417. JMLR.org, (2015)Disentangling the Causes of Plasticity Loss in Neural Networks., , , , , , and . CoRR, (2024)Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation., , , , , , and . ICLR, OpenReview.net, (2023)Blockchain-based Verifiable Credential Sharing with Selective Disclosure., , , , and . TrustCom, page 959-966. IEEE, (2020)New perspectives on the natural gradient method.. CoRR, (2014)A Kronecker-factored approximate Fisher matrix for convolution layers., and . ICML, volume 48 of JMLR Workshop and Conference Proceedings, page 573-582. JMLR.org, (2016)Rapid training of deep neural networks without skip connections or normalization layers using Deep Kernel Shaping., , , , , , and . CoRR, (2021)On the validity of kernel approximations for orthogonally-initialized neural networks.. CoRR, (2021)