Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling, , , , , , , , , and 81 other author(s). (2019)cite arxiv:1902.08295.TF-Ranking: Scalable TensorFlow Library for Learning-to-Rank., , , , , , , , , and . KDD, page 2970-2978. ACM, (2019)Measuring and Harnessing Transference in Multi-Task Learning., , , , , and . CoRR, (2020)Efficiently Identifying Task Groupings for Multi-Task Learning., , , , , and . CoRR, (2021)A Computationally Efficient Sparsified Online Newton Method., , , , , and . CoRR, (2023)Sketchy: Memory-efficient Adaptive Regularization with Frequent Directions., , , , and . CoRR, (2023)Large-Scale Differentially Private BERT., , , , and . EMNLP (Findings), page 6481-6491. Association for Computational Linguistics, (2022)Knowledge distillation: A good teacher is patient and consistent., , , , , and . CVPR, page 10915-10924. IEEE, (2022)Stochastic Optimization with Laggard Data Pipelines., , , , and . NeurIPS, (2020)Large scale distributed neural network training through online distillation, , , , , and . (2018)cite arxiv:1804.03235.