Author of the publication

TVT: Two-View Transformer Network for Video Captioning.

, , , and . ACML, volume 95 of Proceedings of Machine Learning Research, page 847-862. PMLR, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Learning with limited and noisy tagging., , , and . ACM Multimedia, page 957-966. ACM, (2013)A Survey of Multi-View Representation Learning., , and . IEEE Trans. Knowl. Data Eng., 31 (10): 1863-1883 (2019)TVT: Two-View Transformer Network for Video Captioning., , , and . ACML, volume 95 of Proceedings of Machine Learning Research, page 847-862. PMLR, (2018)Affine Deformation Model Based Intra Block Copy for Intra Frame Coding., , , , , , and . ISCAS, page 1-5. IEEE, (2020)Training a Lightweight ViT Network for Image Retrieval., , , and . PRICAI (3), volume 13631 of Lecture Notes in Computer Science, page 240-250. Springer, (2022)Low-Rank HOCA: Efficient High-Order Cross-Modal Attention for Video Captioning., , , and . EMNLP/IJCNLP (1), page 2001-2011. Association for Computational Linguistics, (2019)Mask-free Iterative Refinement Network for weakly-supervised Few-shot Semantic Segmentation., , , , and . Neurocomputing, (2025)Adapting Pre-trained Generative Model to Medical Image for Data Augmentation., , , , , and . MICCAI (5), volume 15005 of Lecture Notes in Computer Science, page 79-89. Springer, (2024)Relative Attribute Learning with Deep Attentive Cross-image Representation., , and . ACML, volume 95 of Proceedings of Machine Learning Research, page 879-892. PMLR, (2018)Video-Grounded Dialogues with Joint Video and Image Training., , and . ICIP, page 3903-3907. IEEE, (2022)