Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models., , , , , , , and . CoRR, (2021)VT-SSum: A Benchmark Dataset for Video Transcript Segmentation and Summarization., , , and . CoRR, (2021)Language Is Not All You Need: Aligning Perception with Language Models., , , , , , , , , and 8 other author(s). CoRR, (2023)LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding., , , , , , , , , and 2 other author(s). CoRR, (2020)DiT: Self-supervised Pre-training for Document Image Transformer., , , , , and . ACM Multimedia, page 3530-3539. ACM, (2022)XDoc: Unified Pre-training for Cross-Format Document Understanding., , , , and . EMNLP (Findings), page 1006-1016. Association for Computational Linguistics, (2022)TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering., , , , , and . CoRR, (2023)LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking., , , , and . ACM Multimedia, page 4083-4091. ACM, (2022)TrOCR: Transformer-Based Optical Character Recognition with Pre-trained Models., , , , , , , , and . AAAI, page 13094-13102. AAAI Press, (2023)LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding., , , , , , , , , and 2 other author(s). ACL/IJCNLP (1), page 2579-2591. Association for Computational Linguistics, (2021)