Author of the publication

Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions.

, , , , , , , and . CVPR, page 5026-5035. IEEE, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training., , , , , , and . NeurIPS, page 4514-4528. (2021)Sed-Net: Detecting Multi-Type Edits Of Images., , , , and . ICME, page 1-6. IEEE, (2020)Tri-axial Motion Sensing with Mechanomagnetic Effect for Human-Machine Interface., , , and . ICIRA (4), volume 13458 of Lecture Notes in Computer Science, page 29-38. Springer, (2022)Learning Fine-Grained Motion Embedding for Landscape Animation., , , , , and . ACM Multimedia, page 291-299. ACM, (2021)Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning., , , , , and . NeurIPS, (2022)Unifying Multimodal Transformer for Bi-directional Image and Text Generation., , , and . ACM Multimedia, page 1138-1147. ACM, (2021)CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Alignment., , , , , , and . ICLR, OpenReview.net, (2023)CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment., , , , , , and . CoRR, (2022)Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training., , , , , , and . CoRR, (2021)Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions., , , , , , , and . CVPR, page 5026-5035. IEEE, (2022)