Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Truly Multi-modal YouTube-8M Video Classification with Video, Audio, and Text., , , , , , , , , and 6 other author(s). CoRR, (2017)Activity Recognition in Egocentric Life-Logging Videos., , , , , and . ACCV Workshops (3), volume 9010 of Lecture Notes in Computer Science, page 445-458. Springer, (2014)Modeling Entities as Semantic Points for Visual Information Extraction in the Wild., , , , , , , and . CVPR, page 15358-15367. IEEE, (2023)Deep Adaptive Temporal Pooling for Activity Recognition., , , and . ACM Multimedia, page 1829-1837. ACM, (2018)ICDAR 2023 Competition on Born Digital Video Text Question Answering., , , , , , , and . ICDAR (2), volume 14188 of Lecture Notes in Computer Science, page 508-521. Springer, (2023)Multimodal Multi-Stream Deep Learning for Egocentric Activity Recognition., , , , , , , and . CVPR Workshops, page 378-385. IEEE Computer Society, (2016)OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition., , , , , , , , and . CoRR, (2024)On classification of distorted images with deep convolutional neural networks., , and . ICASSP, page 1213-1217. IEEE, (2017)Egocentric activity recognition with multimodal fisher vector., , , , and . ICASSP, page 2717-2721. IEEE, (2016)Vision-Language Pre-Training for Boosting Scene Text Detectors., , , , , , and . CVPR, page 15660-15670. IEEE, (2022)