Article,

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation.

H. Luo, L. Ji, B. Shi, H. Huang, N. Duan, T. Li, X. Chen, and M. Zhou.
CoRR, (2020)

Meta data

BibTeX key: journals/corr/abs-2002-06353
entry type: article
year: 2020
journal: CoRR
volume: abs/2002.06353
ee: https://arxiv.org/abs/2002.06353
url: http://dblp.uni-trier.de/db/journals/corr/corr2002.html#abs-2002-06353

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on