Author of the publication

Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training.

, , , , , and . ACM Multimedia, page 7070-7074. ACM, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Contextual and selective attention networks for image captioning., , , , , and . Sci. China Inf. Sci., (2022)SPE-Net: Boosting Point Cloud Analysis via Rotation Robustness Enhancement., , , , , and . ECCV (3), volume 13663 of Lecture Notes in Computer Science, page 593-609. Springer, (2022)Out-of-Distribution Detection via Conditional Kernel Independence Model., , , , , , and . NeurIPS, (2022)VireoJD-MM @ TRECVid 2019: Activities in Extended Video (ActEV)., , , and . TRECVID, National Institute of Standards and Technology (NIST), (2019)Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training., , , , , and . ACM Multimedia, page 7070-7074. ACM, (2022)Comprehending and Ordering Semantics for Image Captioning., , , and . CVPR, page 17969-17978. IEEE, (2022)Modality-Agnostic Debiasing for Single Domain Generalization., , , , , and . CVPR, page 24142-24151. IEEE, (2023)Representing Videos As Discriminative Sub-Graphs for Action Recognition., , , , , and . CVPR, page 3310-3319. Computer Vision Foundation / IEEE, (2021)Exploring Category-Agnostic Clusters for Open-Set Domain Adaptation., , , , and . CVPR, page 13864-13872. Computer Vision Foundation / IEEE, (2020)iDirector: An Intelligent Directing System for Live Broadcast., , , , , , and . ACM Multimedia, page 4545-4547. ACM, (2020)