Author of the publication

Augmenting Images for ASR and TTS Through Single-Loop and Dual-Loop Multimodal Chain Framework.

, , , and . INTERSPEECH, page 4901-4905. ISCA, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Listening while speaking: Speech chain by deep learning., , and . ASRU, page 301-308. IEEE, (2017)Speech Chain for Semi-Supervised Learning of Japanese-English Code-Switching ASR and TTS., , , and . SLT, page 182-189. IEEE, (2018)End-to-End Feedback Loss in Speech Chain Framework via Straight-Through Estimator., , and . CoRR, (2018)Speech recognition features based on deep latent Gaussian models., , and . MLSP, page 1-6. IEEE, (2017)Combining depth image and skeleton data from Kinect for recognizing words in the sign system for Indonesian language (SIBI (Sistem Isyarat Bahasa Indonesia)), , , , and . 2013 International Conference on Advanced Computer Science and Information Systems (ICACSIS), page 387-392. IEEE, (September 2013)Transformer-Based Acoustic Modeling for Hybrid Speech Recognition., , , , , , , , , and 3 other author(s). ICASSP, page 6874-6878. IEEE, (2020)Speech-to-Speech Translation Between Untranscribed Unknown Languages., , and . ASRU, page 593-600. IEEE, (2019)Generative Pre-training for Speech with Flow Matching., , , , , and . CoRR, (2023)Local Monotonic Attention Mechanism for End-to-End Speech And Language Processing., , and . IJCNLP(1), page 431-440. Asian Federation of Natural Language Processing, (2017)Learning ASR Pathways: A Sparse Multilingual ASR Model., , , , , and . ICASSP, page 1-5. IEEE, (2023)