Author of the publication

Video Dialog via Progressive Inference and Cross-Transformer.

, , , , , and . EMNLP/IJCNLP (1), page 2109-2118. Association for Computational Linguistics, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Video Relation Detection with Spatio-Temporal Graph., , , , , and . ACM Multimedia, page 84-93. ACM, (2019)Video Question Answering via Knowledge-based Progressive Spatial-Temporal Attention Network., , , , , and . TOMM, 15 (2s): 52:1-52:22 (2019)Fast view-based 3D model retrieval via unsupervised multiple feature fusion and online projection learning., , , and . Signal Process., (2016)Visual Verification of Historical Chinese Calligraphy Works., and . MMM (1), volume 4351 of Lecture Notes in Computer Science, page 354-363. Springer, (2007)CMSOF: a structured data organization framework for scanned Chinese medicine books in digital libraries., , , , and . J. Zhejiang Univ. Sci. C, 11 (11): 882-892 (2010)Mining Spatial-Temporal Patterns and Structural Sparsity for Human Motion Data Denoising., , , , , , and . IEEE Trans. Cybern., 45 (12): 2693-2706 (2015)Content-based video retrieval integrating human perception., , and . Storage and Retrieval for Media Databases, volume 4315 of SPIE Proceedings, page 562-570. SPIE, (2001)Popular music retrieval by detecting mood., , and . SIGIR, page 375-376. ACM, (2003)User Preference Learning for Online Social Recommendation., , , , and . IEEE Trans. Knowl. Data Eng., 28 (9): 2522-2534 (2016)Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models., , , , , and . CoRR, (2023)