Author of the publication

Cloze-driven Pretraining of Self-attention Networks.

, , , , and . EMNLP/IJCNLP (1), page 5359-5368. Association for Computational Linguistics, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Llama 2: Open Foundation and Fine-Tuned Chat Models, , , , , , , , , and 58 other author(s). (Jul 18, 2023)Dense Passage Retrieval for Open-Domain Question Answering, , , , , , , and . (2020)cite arxiv:2004.04906Comment: EMNLP 2020.Facebook FAIR's WMT19 News Translation Task Submission., , , , , and . WMT (2), page 314-319. Association for Computational Linguistics, (2019)Facebook AI's WMT21 News Translation Task Submission., , , , , and . WMT@EMNLP, page 205-215. Association for Computational Linguistics, (2021)Cloze-driven Pretraining of Self-attention Networks., , , , and . EMNLP/IJCNLP (1), page 5359-5368. Association for Computational Linguistics, (2019)Training ASR Models By Generation of Contextual Information., , , , , , , , , and 1 other author(s). ICASSP, page 7864-7868. IEEE, (2020)fairseq: A Fast, Extensible Toolkit for Sequence Modeling., , , , , , , and . NAACL-HLT (Demonstrations), page 48-53. Association for Computational Linguistics, (2019)Classical Structured Prediction Losses for Sequence to Sequence Learning., , , , and . NAACL-HLT, page 355-364. Association for Computational Linguistics, (2018)Understanding Back-Translation at Scale., , , and . EMNLP, page 489-500. Association for Computational Linguistics, (2018)Beyond English-Centric Multilingual Machine Translation., , , , , , , , , and 6 other author(s). J. Mach. Learn. Res., (2021)