Author of the publication

Mask Attention Networks: Rethinking and Strengthen Transformer.

, , , , , , , , and . NAACL-HLT, page 1692-1701. Association for Computational Linguistics, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

TwinBERT: Distilling Knowledge to Twin-Structured BERT Models for Efficient Retrieval., , and . CoRR, (2020)Healing Unsafe Dialogue Responses with Weak Supervision Signals., , , , , , and . CoRR, (2023)TwinBERT: Distilling Knowledge to Twin-Structured Compressed BERT Models for Large-Scale Retrieval., , and . CIKM, page 2645-2652. ACM, (2020)The impact of visual appearance on user response in online display advertising., , , , , and . WWW (Companion Volume), page 457-458. ACM, (2012)Mask Attention Networks: Rethinking and Strengthen Transformer., , , , , , , , and . NAACL-HLT, page 1692-1701. Association for Computational Linguistics, (2021)MERGE: Fast Private Text Generation., , , , , , , and . AAAI, page 19884-19892. AAAI Press, (2024)A Probabilistic Semantic Model for Image Annotation and Multi-Modal Image Retrieva., , , , and . ICCV, page 846-851. IEEE Computer Society, (2005)A data mining approach to modeling relationships among categories in image collection., , and . KDD, page 749-754. ACM, (2004)BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale Pretraining., , , , , , , , , and 2 other author(s). ICML, volume 139 of Proceedings of Machine Learning Research, page 8630-8639. PMLR, (2021)GLGE: A New General Language Generation Evaluation Benchmark., , , , , , , , , and 8 other author(s). ACL/IJCNLP (Findings), volume ACL/IJCNLP 2021 of Findings of ACL, page 408-420. Association for Computational Linguistics, (2021)