Author of the publication

Dynamical Isometry and a Mean Field Theory of CNNs: How to Train 10, 000-Layer Vanilla Convolutional Neural Networks.

, , , , and . ICML, volume 80 of Proceedings of Machine Learning Research, page 5389-5398. PMLR, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Quantum Many-Body Physics Calculations with Large Language Models., , , , , , , and . CoRR, (2024)Bayesian Deep Convolutional Networks with Many Channels are Gaussian Processes., , , , , , , , and . ICLR (Poster), OpenReview.net, (2019)The large learning rate phase of deep learning: the catapult mechanism, , , , and . arXiv preprint arXiv:2003.02218, (2020)Deep Neural Networks as Gaussian Processes., , , , , and . ICLR (Poster), OpenReview.net, (2018)Infinite attention: NNGP and NTK for deep attention networks., , , and . ICML, volume 119 of Proceedings of Machine Learning Research, page 4376-4386. PMLR, (2020)Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models., , , , , , , , , and 440 other author(s). CoRR, (2022)The Evolution of Out-of-Distribution Robustness Throughout Fine-Tuning., , , and . CoRR, (2021)Geometry of Neural Network Loss Surfaces via Random Matrix Theory., and . ICML, volume 70 of Proceedings of Machine Learning Research, page 2798-2806. PMLR, (2017)Bayesian Deep Convolutional Networks with Many Channels are Gaussian Processes, , , , , , , , and . (2018)cite arxiv:1810.05148Comment: Published as a conference paper at ICLR 2019.Statistical Mechanics of Deep Learning, , , , , and . Annual Review of Condensed Matter Physics, 11 (1): null (2020)