Author of the publication

The NAIST ASR system for the 2015 Multi-Genre Broadcast challenge: On combination of deep learning systems using a rank-score function.

, , , , , and . ASRU, page 654-659. IEEE, (2015)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Robust Bimodal Speech Section Detection., and . VLSI Signal Processing, 36 (2-3): 81-90 (2004)Reversible display: novel interaction techniques for digital contents.. Critical Computing, page 113-116. ACM, (2005)A Study Toward an Evaluation Method for Spoken Dialogue Systems Considering User Criteria., , , and . IWSDS, volume 6392 of Lecture Notes in Computer Science, page 176-181. Springer, (2010)Sightseeing Guidance Systems Based on WFST-Based Dialogue Manager., , , , , , , , , and . IWSDS, volume 6392 of Lecture Notes in Computer Science, page 194-195. Springer, (2010)Evaluation of Facial Direction Estimation from Cameras for Multi-modal Spoken Dialog System., , , , , , and . IWSDS, volume 6392 of Lecture Notes in Computer Science, page 73-84. Springer, (2010)Compressing Recurrent Neural Network with Tensor Train., , and . CoRR, (2017)Fusion of Audio-Visual Information for Integrated Speech Processing.. AVBPA, volume 2091 of Lecture Notes in Computer Science, page 127-143. Springer, (2001)Japanese Spontaneous Spoken Document Retrieval Using NMF-Based Topic Models., , , and . AIRS, volume 5839 of Lecture Notes in Computer Science, page 149-156. Springer, (2009)Korean pronunciation variation modeling with probabilistic Bayesian networks., , , , and . IUCS, page 52-57. IEEE, (2010)Implementation of F0 transformation for statistical singing voice conversion based on direct waveform modification., , and . ICASSP, page 5670-5674. IEEE, (2016)