Author of the publication

bert2BERT: Towards Reusable Pretrained Language Models.

, , , , , , , , , and . ACL (1), page 2134-2148. Association for Computational Linguistics, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Temporal Exemplar-Based Bayesian Networks for Facial Expression Recognition., and . ICMLA, page 16-22. IEEE Computer Society, (2008)On Estimating Variances for Topic Set Size Design., and . EVIA@NTCIR, National Institute of Informatics (NII), (2016)Binary Image Thinning Using Autowaves Generated by PCNN., , and . Neural Process. Lett., 25 (1): 49-62 (2007)Rigid medical image registration using PCA neural network., , and . Neurocomputing, 69 (13-15): 1717-1722 (2006)Noninvasive Self-attention for Side Information Fusion in Sequential Recommendation., , , , , and . AAAI, page 4249-4256. AAAI Press, (2021)Test Collections and Measures for Evaluating Customer-Helpdesk Dialogues., , , , and . EVIA@NTCIR, volume 2008 of CEUR Workshop Proceedings, page 1-9. CEUR-WS.org, (2017)DynaBERT: Dynamic BERT with Adaptive Width and Depth., , , , , and . NeurIPS, (2020)Improved OOD Generalization via Adversarial Training and Pretraing., , , , , , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 11987-11997. PMLR, (2021)Retrieval-free Knowledge Injection through Multi-Document Traversal for Dialogue Models., , , , , , , , , and . ACL (1), page 6608-6619. Association for Computational Linguistics, (2023)AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models., , , , , and . ACL/IJCNLP (1), page 5146-5157. Association for Computational Linguistics, (2021)