Author of the publication

Sharp Minima Can Generalize For Deep Nets

, , , and . (2017)cite arxiv:1703.04933Comment: 8.5 pages of main content, 2.5 of bibliography and 1 page of appendix.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

The Expected Performance Curve, , and . In Proceedings of the 22nd International Conference on Machine Learning, page 9--16. (2005)Order Matters: Sequence to sequence for sets, , and . (2015)cite arxiv:1511.06391Comment: Accepted as a conference paper at ICLR 2015.Extracting information from multimedia meeting collections., , and . Multimedia Information Retrieval, page 245-252. ACM, (2005)Area Attention., , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 3846-3855. PMLR, (2019)A Discriminative Approach for the Retrieval of Images from Text Queries., , and . ECML, volume 4212 of Lecture Notes in Computer Science, page 162-173. Springer, (2006)Estimating the Confidence Interval of Expected Performance Curve in Biometric Authentication using Joint Bootstrap., and . ICASSP (2), page 137-140. IEEE, (2007)Modeling human interaction in meetings., , , , , , , and . ICASSP (4), page 748-751. IEEE, (2003)Client Dependent GMM-SVM Models for Speaker Verification., and . ICANN, volume 2714 of Lecture Notes in Computer Science, page 443-451. Springer, (2003)When can transformers reason with abstract symbols?, , , , , and . CoRR, (2023)Learning a Synaptic Learning Rule, and . 751. Département d'Informatique et de Recherche Opérationelle, Université de Montréal, Montreal, Canada, (1990)