Author of the publication

What Will it Take to Fix Benchmarking in Natural Language Understanding?

, and . NAACL-HLT, page 4843-4855. Association for Computational Linguistics, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Large scale distributed neural network training through online distillation, , , , , and . (2018)cite arxiv:1804.03235.Training Restricted Boltzmann Machines on Word Observations., , and . ICML, icml.cc / Omnipress, (2012)Pre-training helps Bayesian optimization too., , , , , , , , and . CoRR, (2022)What Will it Take to Fix Benchmarking in Natural Language Understanding?, and . NAACL-HLT, page 4843-4855. Association for Computational Linguistics, (2021)Deep Belief Networks using discriminative features for phone recognition., , , , , and . ICASSP, page 5060-5063. IEEE, (2011)The Importance of Generation Order in Language Modeling., , , and . EMNLP, page 2942-2946. Association for Computational Linguistics, (2018)On the importance of initialization and momentum in deep learning., , , and . ICML (3), volume 28 of JMLR Workshop and Conference Proceedings, page 1139-1147. JMLR.org, (2013)Improvements to Deep Convolutional Neural Networks for LVCSR., , , , , , , , and . ASRU, page 315-320. IEEE, (2013)Incorporating Side Information in Probabilistic Matrix Factorization with Gaussian Processes., , and . UAI, page 1-9. AUAI Press, (2010)Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model., , , , , , , and . NeurIPS, page 8194-8205. (2019)