Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks.

X. Li, X. Yin, C. Li, P. Zhang, X. Hu, L. Zhang, L. Wang, H. Hu, L. Dong, F. Wei, Y. Choi, and J. Gao. ECCV (30), volume 12375 of Lecture Notes in Computer Science, page 121-137. Springer, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Jun Hu

Zaijun Hu

Bing Hu

Ruguo Hu

Hu Wang

Other publications of authors with the same name

Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging.J. Chaves, S. Huang, Y. Xu, H. Xu, N. Usuyama, S. Zhang, F. Wang, Y. Xie, M. Khademi, Z. Yang and 16 other author(s). CoRR, (2024)An Universal Image Attractiveness Ranking Framework.N. Ma, A. Volkov, A. Livshits, P. Pietrusinski, H. Hu, and M. Bolin. WACV, page 657-665. IEEE, (2019)Stacked Cross Attention for Image-Text Matching.K. Lee, X. Chen, G. Hua, H. Hu, and X. He. ECCV (4), volume 11208 of Lecture Notes in Computer Science, page 212-228. Springer, (2018)Image Scene Graph Generation (SGG) Benchmark.X. Han, J. Yang, H. Hu, L. Zhang, J. Gao, and P. Zhang. CoRR, (2021)Electronic Structure Models: Solution Theory, Linear Scaling Methods, and Stability AnalysisH. Hu. University of California, San Diego, USA, (2014)MMPTRACK: Large-scale Densely Annotated Multi-camera Multiple People Tracking Benchmark.X. Han, Q. You, C. Wang, Z. Zhang, P. Chu, H. Hu, J. Wang, and Z. Liu. WACV, page 4849-4858. IEEE, (2023)Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks.X. Li, X. Yin, C. Li, P. Zhang, X. Hu, L. Zhang, L. Wang, H. Hu, L. Dong, F. Wei and 2 other author(s). ECCV (30), volume 12375 of Lecture Notes in Computer Science, page 121-137. Springer, (2020)Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks.B. Xiao, H. Wu, W. Xu, X. Dai, H. Hu, Y. Lu, M. Zeng, C. Liu, and L. Yuan. CVPR, page 4818-4829. IEEE, (2024)Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks.B. Xiao, H. Wu, W. Xu, X. Dai, H. Hu, Y. Lu, M. Zeng, C. Liu, and L. Yuan. CoRR, (2023)ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models.C. Li, H. Liu, L. Li, P. Zhang, J. Aneja, J. Yang, P. Jin, H. Hu, Z. Liu, Y. Lee and 1 other author(s). NeurIPS, (2022)

BibSonomy

Disambiguation of "Hu, Houdong"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks.

Please choose a person to relate this publication to

Jun Hu

Zaijun Hu

Bing Hu

Ruguo Hu

Hu Wang

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Hu, Houdong"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks.

Please choose a person to relate this publication to

Jun Hu

Zaijun Hu

Bing Hu

Ruguo Hu

Hu Wang

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks.