Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training., , , , , and . CoRR, (2022)ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding., , , , , , , , , and 5 other author(s). EMNLP (Findings), page 3744-3756. Association for Computational Linguistics, (2022)ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding., , , , , , , , , and 5 other author(s). CoRR, (2022)A Novel Multi-view Object Class Detection Framework for Document Image Content Analysis., , and . ICDAR, page 1095-1099. IEEE Computer Society, (2013)ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation., , , , , , , , , and . CoRR, (2021)mmLayout: Multi-grained MultiModal Transformer for Document Understanding., , , , , , , , , and 1 other author(s). ACM Multimedia, page 4877-4886. ACM, (2022)ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts., , , , , , , , , and 5 other author(s). CVPR, page 10135-10145. IEEE, (2023)ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation., , , , , , , and . CoRR, (2022)ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding., , , , , , , , , and 1 other author(s). CoRR, (2022)ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph., , , , , , and . CoRR, (2020)