Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding.

Y. Xu, Y. Xu, T. Lv, L. Cui, F. Wei, G. Wang, Y. Lu, D. Florêncio, C. Zhang, W. Che, M. Zhang, and L. Zhou. CoRR, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Ivan Lvov

Zhiyi Lv

Guohua Lv

Zuopeng LV

Li-Ping Lv

Other publications of authors with the same name

TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models.M. Li, T. Lv, L. Cui, Y. Lu, D. Florêncio, C. Zhang, Z. Li, and F. Wei. CoRR, (2021)VT-SSum: A Benchmark Dataset for Video Transcript Segmentation and Summarization.T. Lv, L. Cui, M. Vasilijevic, and F. Wei. CoRR, (2021)Language Is Not All You Need: Aligning Perception with Language Models.S. Huang, L. Dong, W. Wang, Y. Hao, S. Singhal, S. Ma, T. Lv, L. Cui, O. Mohammed, B. Patra and 8 other author(s). CoRR, (2023)LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding.Y. Xu, Y. Xu, T. Lv, L. Cui, F. Wei, G. Wang, Y. Lu, D. Florêncio, C. Zhang, W. Che and 2 other author(s). CoRR, (2020)DiT: Self-supervised Pre-training for Document Image Transformer.J. Li, Y. Xu, T. Lv, L. Cui, C. Zhang, and F. Wei. ACM Multimedia, page 3530-3539. ACM, (2022)XDoc: Unified Pre-training for Cross-Format Document Understanding.J. Chen, T. Lv, L. Cui, C. Zhang, and F. Wei. EMNLP (Findings), page 1006-1016. Association for Computational Linguistics, (2022)TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering.J. Chen, Y. Huang, T. Lv, L. Cui, Q. Chen, and F. Wei. CoRR, (2023)LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking.Y. Huang, T. Lv, L. Cui, Y. Lu, and F. Wei. ACM Multimedia, page 4083-4091. ACM, (2022)TrOCR: Transformer-Based Optical Character Recognition with Pre-trained Models.M. Li, T. Lv, J. Chen, L. Cui, Y. Lu, D. Florêncio, C. Zhang, Z. Li, and F. Wei. AAAI, page 13094-13102. AAAI Press, (2023)LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding.Y. Xu, Y. Xu, T. Lv, L. Cui, F. Wei, G. Wang, Y. Lu, D. Florêncio, C. Zhang, W. Che and 2 other author(s). ACL/IJCNLP (1), page 2579-2591. Association for Computational Linguistics, (2021)

BibSonomy

Disambiguation of "Lv, Tengchao"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding.

Please choose a person to relate this publication to

Ivan Lvov

Zhiyi Lv

Guohua Lv

Zuopeng LV

Li-Ping Lv

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Lv, Tengchao"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding.

Please choose a person to relate this publication to

Ivan Lvov

Zhiyi Lv

Guohua Lv

Zuopeng LV

Li-Ping Lv

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding.