From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning.

X. Xu, B. Li, C. Wu, S. Tseng, A. Bhiwandiwalla, S. Rosenman, V. Lal, W. Che, и N. Duan. ACL (1), стр. 14507-14525. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Jinglan Wu

Recovery of citric acid from fermentation broth using simulated moving bed technology: = Reinigung von Zitronensäure aus Fermentationslösung durch kontinuierliche ChromatographieJ. Wu. Uni Erlangen-Nürnberg, (2009)

Changzhu Wu

Yinghua Wu

Jinjin Wu

Haiyan Wu

Другие публикации лиц с тем же именем

EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation.W. You, W. Wu, Y. Liang, S. Mao, C. Wu, M. Cao, Y. Cai, Y. Guo, Y. Xia, F. Wei и 1 other автор(ы). CoRR, (2023)Trace Controlled Text to Image Generation.K. Yan, L. Ji, C. Wu, J. Bao, M. Zhou, N. Duan, и S. Ma. ECCV (36), том 13696 из Lecture Notes in Computer Science, стр. 59-75. Springer, (2022)KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation.Y. Liu, C. Wu, S. Tseng, V. Lal, X. He, и N. Duan. NAACL-HLT (Findings), стр. 1589-1600. Association for Computational Linguistics, (2022)ReCo: Region-Controlled Text-to-Image Generation.Z. Yang, J. Wang, Z. Gan, L. Li, K. Lin, C. Wu, N. Duan, Z. Liu, C. Liu, M. Zeng и 1 other автор(ы). CVPR, стр. 14246-14255. IEEE, (2023)NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis.J. Liang, C. Wu, X. Hu, Z. Gan, J. Wang, L. Wang, Z. Liu, Y. Fang, и N. Duan. NeurIPS, (2022)BridgeTower: Building Bridges between Encoders in Vision-Language Representation Learning.X. Xu, C. Wu, S. Rosenman, V. Lal, W. Che, и N. Duan. AAAI, стр. 10637-10647. AAAI Press, (2023)GEM: A General Evaluation Benchmark for Multimodal Tasks.L. Su, N. Duan, E. Cui, L. Ji, C. Wu, H. Luo, Y. Liu, M. Zhong, T. Bharti, и A. Sacheti. ACL/IJCNLP (Findings), том ACL/IJCNLP 2021 из Findings of ACL, стр. 2594-2603. Association for Computational Linguistics, (2021)ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning.X. Xu, B. Li, C. Wu, S. Tseng, A. Bhiwandiwalla, S. Rosenman, V. Lal, W. Che, и N. Duan. ACL (1), стр. 14507-14525. Association for Computational Linguistics, (2023)Sequential Visual Reasoning for Visual Question Answering.J. Liu, C. Wu, X. Wang, и X. Dong. CCIS, стр. 410-415. IEEE, (2018)LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language ModelsZ. Tang, C. Wu, J. Li, и N. Duan. (2023)cite arxiv:2309.09506.

BibSonomy

Disambiguation