Author of the publication

Dynamical Isometry and a Mean Field Theory of RNNs: Gating Enables Signal Propagation in Recurrent Neural Networks.

, , and . ICML, volume 80 of Proceedings of Machine Learning Research, page 872-881. PMLR, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Clinical Data Research Network Lessons Learned., , , , , , , , and . CRI, AMIA, (2016)Deep Neural Networks as Gaussian Processes., , , , , and . ICLR (Poster), OpenReview.net, (2018)Nonlinear random matrix theory for deep learning., and . NIPS, page 2637-2646. (2017)Second-order regression models exhibit progressive sharpening to the edge of stability., , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 169-195. PMLR, (2023)A Random Matrix Perspective on Mixtures of Nonlinearities in High Dimensions., , and . AISTATS, volume 151 of Proceedings of Machine Learning Research, page 3434-3457. PMLR, (2022)Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models., , , , , , , , , and 31 other author(s). Trans. Mach. Learn. Res., (2024)Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability., , , , , , , , , and 21 other author(s). CoRR, (2024)KAMA-NNs: Low-dimensional Rotation Based Neural Networks., , , and . AISTATS, volume 89 of Proceedings of Machine Learning Research, page 236-245. PMLR, (2019)A Random Matrix Perspective on Mixtures of Nonlinearities for Deep Learning., , and . CoRR, (2019)Provable Benefit of Orthogonal Initialization in Optimizing Deep Linear Networks., , and . ICLR, OpenReview.net, (2020)