Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Large scale distributed neural network training through online distillation, , , , , and . (2018)cite arxiv:1804.03235.Deep Belief Networks using discriminative features for phone recognition., , , , , and . ICASSP, page 5060-5063. IEEE, (2011)Pre-training helps Bayesian optimization too., , , , , , , , and . CoRR, (2022)What Will it Take to Fix Benchmarking in Natural Language Understanding?, and . NAACL-HLT, page 4843-4855. Association for Computational Linguistics, (2021)Training Restricted Boltzmann Machines on Word Observations., , and . ICML, icml.cc / Omnipress, (2012)The Importance of Generation Order in Language Modeling., , , and . EMNLP, page 2942-2946. Association for Computational Linguistics, (2018)Improvements to Deep Convolutional Neural Networks for LVCSR., , , , , , , , and . ASRU, page 315-320. IEEE, (2013)Multi-task Neural Networks for QSAR Predictions., , and . CoRR, (2014)Incorporating Side Information in Probabilistic Matrix Factorization with Gaussian Processes., , and . UAI, page 1-9. AUAI Press, (2010)Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model., , , , , , , and . NeurIPS, page 8194-8205. (2019)