Author of the publication

DiT: Self-supervised Pre-training for Document Image Transformer.

, , , , , and . ACM Multimedia, page 3530-3539. ACM, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Smart rebinning for compression of concentric mosaics., , , and . ACM Multimedia, page 201-209. ACM, (2000)LeGR: Filter Pruning via Learned Global Ranking., , , and . CoRR, (2019)Efficient feature extraction for 2D/3D objects in mesh representation., and . ICIP (3), page 935-938. IEEE, (2001)Towards optimal least square filters using the eigenfilter approach., and . ICASSP, page 4171. IEEE, (2002)Automatic speech emotion recognition using recurrent neural networks with local attention., , and . ICASSP, page 2227-2231. IEEE, (2017)Nonuniform sampling of image-based rendering data with the position-interval-error (PIE) function., and . VCIP, volume 5150 of Proceedings of SPIE, page 1347-1358. SPIE, (2003)A Simple yet Effective Learnable Positional Encoding Method for Improving Document Transformer Model., , , , , and . AACL/IJCNLP (Findings), page 453-463. Association for Computational Linguistics, (2022)Using Reverberation to Improve Range and Elevation Discrimination for Small Array Sound Source Localization., , , and . IEEE Trans. Speech Audio Process., 18 (7): 1781-1792 (2010)Geometrically Constrained Room Modeling With Compact Microphone Arrays., , , and . IEEE Trans. Speech Audio Process., 20 (5): 1449-1460 (2012)A self-reconfigurable camera array., and . SIGGRAPH Sketches, page 151. ACM, (2004)