Author of the publication

BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation.

, , and . EMNLP (1), page 6663-6675. Association for Computational Linguistics, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

DSmith: Compiler Fuzzing through Generative Deep Learning Model with Attention., , , , and . IJCNN, page 1-9. IEEE, (2020)A method for obtaining maize phenotypic parameters based on improved QuickShift algorithm., , , , , and . Comput. Electron. Agric., (November 2023)Copy-and-Patch Binary Code Generation., and . CoRR, (2020)BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation., , and . EMNLP (1), page 6663-6675. Association for Computational Linguistics, (2021)Poster: DeePTOP: Personalized Tachycardia Onset Prediction Using Bi-directional LSTM in Wearable Embedded Systems., , , , , , , , , and . EWSN, page 216-217. ACM, (2019)Deegen: A Meta-compiler Approach for High Performance VMs at Low Engineering Cost (Invited Talk).. ICOOOLPS@ECOOP, page 1. ACM, (2023)PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning., , , , , and . CoRR, (2023)Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity., , , , and . EMNLP (Findings), page 12858-12870. Association for Computational Linguistics, (2023)Hierarchical Adaptive Value Estimation for Multi-modal Visual Reinforcement Learning., , , , , and . NeurIPS, (2023)Porosity Prediction Based on Ensemble Learning for Feature Selection and an Optimized GRU Improved by the PSO Algorithm., , , , , and . Int. J. Comput. Intell. Syst., 17 (1): 189 (December 2024)