Author of the publication

Listening While Speaking and Visualizing: Improving ASR Through Multimodal Chain.

, , , and . ASRU, page 471-478. IEEE, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Tensor Decomposition for Compressing Recurrent Neural Network., , and . IJCNN, page 1-8. IEEE, (2018)Interactive Image Manipulation with Natural Language Instruction Commands., , , , and . CoRR, (2018)Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition., , , and . CoRR, (2020)Phoneme-level speaking rate variation on waveform generation using GAN-TTS., , and . O-COCOSDA, page 1-7. IEEE, (2019)Instance-Level Heterogeneous Domain Adaptation for Limited-Labeled Sketch-to-Photo Retrieval., , , , , and . IEEE Trans. Multim., (2021)Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas., , , , and . LREC, European Language Resources Association (ELRA), (2018)Multi-Scale Alignment and Contextual History for Attention Mechanism in Sequence-to-Sequence Model., , and . SLT, page 648-655. IEEE, (2018)Multi-Modal Multi-Task Deep Learning For Speaker And Emotion Recognition Of TV-Series Data., , , , and . O-COCOSDA, page 37-42. IEEE, (2018)An Enhanced Electrolarynx with Automatic Fundamental Frequency Control based on Statistical Prediction., , , , and . ASSETS, page 435-436. ACM, (2015)Syntax-based Simultaneous Translation through Prediction of Unseen Syntactic Constituents., , , , and . ACL (1), page 198-207. The Association for Computer Linguistics, (2015)