Author of the publication

LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding.

, , , , , , , , , , , and . ACL/IJCNLP (1), page 2579-2591. Association for Computational Linguistics, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Sketch-based 3D model retrieval utilizing adaptive view clustering and semantic information., , , and . Multimedia Tools Appl., 76 (24): 26603-26631 (2017)SHREC'13 Track: Retrieval of Objects Captured with Low-Cost Depth-Sensing Cameras., , , , , , , , , and 1 other author(s). 3DOR, page 65-71. Eurographics Association, (2013)3D sketch-based 3D model retrieval with convolutional neural network., , and . ICPR, page 2936-2941. IEEE, (2016)FANet: Quality-Aware Feature Aggregation Network for RGB-T Tracking., , , , , and . CoRR, (2018)Binary SIFT: towards efficient feature matching verification for image search., , , , and . ICIMCS, page 1-6. ACM, (2012)Inferring Chord Sequence Meanings via Lyrics: Process and Evaluation., , , and . ISMIR, page 463-468. FEUP Edições, (2012)Human movement summarization and depiction from videos., and . ICME, page 1-6. IEEE Computer Society, (2013)TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models., , , , , , , and . CoRR, (2021)Personalization in Multimedia Retrieval: A Survey, , , and . Multimedia Tools and Applications, 51 (1): 247-277 (January 2011)Spatial coding for large scale partial-duplicate web image search., , , , and . ACM Multimedia, page 511-520. ACM, (2010)