Article,

VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning.

Q. Zhu, L. Zhou, Z. Zhang, S. Liu, B. Jiao, J. Zhang, L. Dai, D. Jiang, J. Li, and F. Wei.
CoRR, (2022)

Meta data

BibTeX key: journals/corr/abs-2211-11275
entry type: article
year: 2022
journal: CoRR
volume: abs/2211.11275
ee: https://doi.org/10.48550/arXiv.2211.11275
url: http://dblp.uni-trier.de/db/journals/corr/corr2211.html#abs-2211-11275

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on