Author of the publication

Three Factors Influencing Minima in SGD

, , , , , , and . (2017)cite arxiv:1711.04623Comment: First two authors contributed equally. Short version accepted into ICLR workshop.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

On the Relation Between the Sharpest Directions of DNN Loss and the SGD Step Length., , , , , and . ICLR (Poster), OpenReview.net, (2019)Algorithms for Estimating the Partition Function of Restricted Boltzmann Machines (Extended Abstract)., , and . IJCAI, page 5045-5049. ijcai.org, (2020)Journal track.Bringing Light Into the Dark: A Large-scale Evaluation of Knowledge Graph Embedding Models Under a Unified Framework, , , , , , , , and . (2020)cite arxiv:2006.13365.Mimicking the radiologists' workflow: Estimating pediatric hand bone age with stacked deep neural networks., , , , , and . Medical Image Anal., (2020)Bidirectional Helmholtz Machines., , , and . ICML, volume 48 of JMLR Workshop and Conference Proceedings, page 2511-2519. JMLR.org, (2016)How Sampling Impacts the Robustness of Stochastic Neural Networks., and . NeurIPS, (2022)Uncertainty-weighted Loss Functions for Improved Adversarial Attacks on Semantic Segmentation., and . CoRR, (2023)Wasserstein dropout., , , , , and . Mach. Learn., 113 (5): 3161-3204 (May 2024)An objective function for STDP., , , , and . CoRR, (2015)Three Factors Influencing Minima in SGD, , , , , , and . (2017)cite arxiv:1711.04623Comment: First two authors contributed equally. Short version accepted into ICLR workshop.