Author of the publication

A Study on Joint Modeling and Data Augmentation of Multi-Modalities for Audio-Visual Scene Classification.

, , , , , , , , , , and . ISCSLP, page 453-457. IEEE, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Convolutional Neural Network Based Radio Tomographic Imaging., , , and . CISS, page 1-6. IEEE, (2020)Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification., , , , , and . CoRR, (2023)Generative error correction for code-switching speech recognition using large language models., , , , , and . CoRR, (2023)Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue., , , , , , , , and . CoRR, (2023)Inference and Denoise: Causal Inference-based Neural Speech Enhancement., , , , and . CoRR, (2022)Exploiting Low-Rank Tensor-Train Deep Neural Networks Based on Riemannian Gradient Descent With Illustrations of Speech Processing., , , and . CoRR, (2022)Wavelet Channel Attention Module With A Fusion Network For Single Image Deraining., , and . ICIP, page 883-887. IEEE, (2020)A Study on Joint Modeling and Data Augmentation of Multi-Modalities for Audio-Visual Scene Classification., , , , , , , , , and 1 other author(s). ISCSLP, page 453-457. IEEE, (2022)Low-Resource Music Genre Classification with Cross-Modal Neural Model Reprogramming., , , and . ICASSP, page 1-5. IEEE, (2023)From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition., , , , , , and . ICASSP, page 1-5. IEEE, (2023)