Author of the publication

Meta-KD: A Meta Knowledge Distillation Framework for Language Model Compression across Domains.

, , , , , and . CoRR, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Meta-KD: A Meta Knowledge Distillation Framework for Language Model Compression across Domains., , , , , and . CoRR, (2020)M6: Multi-Modality-to-Multi-Modality Multitask Mega-transformer for Unified Pretraining., , , , , , , , and . KDD, page 3251-3261. ACM, (2021)Graph-based Multi-hop Reasoning for Long Text Generation., , , , , and . CoRR, (2020)Towards Knowledge-Based Recommender Dialog System., , , , , , and . EMNLP/IJCNLP (1), page 1803-1813. Association for Computational Linguistics, (2019)Towards Knowledge-Based Personalized Product Description Generation in E-commerce., , , , , and . KDD, page 3040-3050. ACM, (2019)Sketch and Refine: Towards Faithful and Informative Table-to-Text Generation., , , , , , and . ACL/IJCNLP (Findings), volume ACL/IJCNLP 2021 of Findings of ACL, page 4831-4843. Association for Computational Linguistics, (2021)PRIS at Knowledge Base Population 2013., , , , , , , , , and . TAC, NIST, (2013)OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models., , , , , , , , , and 8 other author(s). CoRR, (2022)M6: A Chinese Multimodal Pretrainer., , , , , , , , , and 15 other author(s). CoRR, (2021)Qwen Technical Report., , , , , , , , , and 38 other author(s). CoRR, (2023)