Author of the publication

LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding.

, , , , , , , , , , , and . ACL/IJCNLP (1), page 2579-2591. Association for Computational Linguistics, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Nonuniform sampling of image-based rendering data with the position-interval-error (PIE) function., and . VCIP, volume 5150 of Proceedings of SPIE, page 1347-1358. SPIE, (2003)LeGR: Filter Pruning via Learned Global Ranking., , , and . CoRR, (2019)Using Reverberation to Improve Range and Elevation Discrimination for Small Array Sound Source Localization., , , and . IEEE Trans. Speech Audio Process., 18 (7): 1781-1792 (2010)Geometrically Constrained Room Modeling With Compact Microphone Arrays., , , and . IEEE Trans. Speech Audio Process., 20 (5): 1449-1460 (2012)Smart rebinning for compression of concentric mosaics., , , and . ACM Multimedia, page 201-209. ACM, (2000)Rate-Constrained 3D Surface Estimation From Noise-Corrupted Multiview Depth Videos., , , , , and . IEEE Trans. Image Processing, 23 (7): 3138-3151 (2014)Efficient feature extraction for 2D/3D objects in mesh representation., and . ICIP (3), page 935-938. IEEE, (2001)Automatic speech emotion recognition using recurrent neural networks with local attention., , and . ICASSP, page 2227-2231. IEEE, (2017)Towards optimal least square filters using the eigenfilter approach., and . ICASSP, page 4171. IEEE, (2002)A Simple yet Effective Learnable Positional Encoding Method for Improving Document Transformer Model., , , , , and . AACL/IJCNLP (Findings), page 453-463. Association for Computational Linguistics, (2022)