Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training.

C. Jiang, W. Ye, H. Xu, Q. Ye, M. Yan, J. Zhang, and S. Zhang. AAAI, page 2489-2497. AAAI Press, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Qinghao Zheng

Fei Ye

Ye Chen

Ying Ye

Ye Hu

Other publications of authors with the same name

Exploring Global Diversity and Local Context for Video Summarization.Y. Pan, O. Huang, Q. Ye, Z. Li, W. Wang, G. Li, and Y. Chen. IEEE Access, (2022)mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration.Q. Ye, H. Xu, J. Ye, M. Yan, A. Hu, H. Liu, Q. Qian, J. Zhang, F. Huang, and J. Zhou. CoRR, (2023)Temporal Cue Guided Video Highlight Detection with Low-Rank Audio-Visual Fusion.Q. Ye, X. Shen, Y. Gao, Z. Wang, Q. Bi, P. Li, and G. Yang. ICCV, page 7930-7939. IEEE, (2021)COPA : Efficient Vision-Language Pre-training through Collaborative Object- and Patch-Text Alignment.C. Jiang, H. Xu, W. Ye, Q. Ye, C. Li, M. Yan, B. Bi, S. Zhang, F. Huang, and J. Zhang. ACM Multimedia, page 4480-4491. ACM, (2023)UniQRNet: Unifying Referring Expression Grounding and Segmentation with QRNet.J. Ye, J. Tian, M. Yan, H. Xu, Q. Ye, Y. Shi, X. Yang, X. Wang, J. Zhang, L. He and 1 other author(s). ACM Trans. Multim. Comput. Commun. Appl., 20 (8): 246:1-246:28 (August 2024)TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training.C. Jiang, W. Ye, H. Xu, Q. Ye, M. Yan, J. Zhang, and S. Zhang. AAAI, page 2489-2497. AAAI Press, (2024)Learning Trajectory-Word Alignments for Video-Language Tasks.X. Yang, Z. Li, H. Xu, H. Zhang, Q. Ye, C. Li, M. Yan, Y. Zhang, F. Huang, and S. Huang. ICCV, page 2504-2514. IEEE, (2023)Transforming Visual Scene Graphs to Image Captions.X. Yang, J. Peng, Z. Wang, H. Xu, Q. Ye, C. Li, S. Huang, F. Huang, Z. Li, and Y. Zhang. ACL (1), page 12427-12440. Association for Computational Linguistics, (2023)Evaluation and Analysis of Hallucination in Large Vision-Language Models.J. Wang, Y. Zhou, G. Xu, P. Shi, C. Zhao, H. Xu, Q. Ye, M. Yan, J. Zhang, J. Zhu and 2 other author(s). CoRR, (2023)mPLUG-Octopus: The Versatile Assistant Empowered by A Modularized End-to-End Multimodal LLM.Q. Ye, H. Xu, M. Yan, C. Zhao, J. Wang, X. Yang, J. Zhang, F. Huang, J. Sang, and C. Xu. ACM Multimedia, page 9365-9367. ACM, (2023)

BibSonomy

Disambiguation of "Ye, Qinghao"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training.

Please choose a person to relate this publication to

Qinghao Zheng

Fei Ye

Ye Chen

Ying Ye

Ye Hu

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Ye, Qinghao"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training.

Please choose a person to relate this publication to

Qinghao Zheng

Fei Ye

Ye Chen

Ying Ye

Ye Hu

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training.