Author of the publication

Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio Representations.

, , , and . INTERSPEECH, page 501-505. ISCA, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Isolated Mandarin syllable recognition with limited training data specially considering the effect of tones., , and . IEEE Trans. Speech Audio Process., 5 (1): 75-80 (1997)Model-Based Unsupervised Spoken Term Detection with Spoken Queries., and . IEEE Trans. Speech Audio Process., 21 (7): 1330-1342 (2013)Improved semantic retrieval of spoken content by language models enhanced with acoustic similarity graph., , and . SLT, page 182-187. IEEE, (2012)Improved spoken term detection with graph-based re-ranking in feature space., , , , and . ICASSP, page 5644-5647. IEEE, (2011)An initial attempt to improve spoken term detection by learning optimal weights for different indexing features., , , and . ICASSP, page 5278-5281. IEEE, (2010)Integrating recognition and retrieval with user feedback: A new framework for spoken term detection., and . ICASSP, page 5290-5293. IEEE, (2010)Continuous hidden Markov models integrating transitional and instantaneous features for Mandarin syllable recognition., and . Comput. Speech Lang., 7 (3): 247-263 (1993)Enhancing sparse voice annotation for semantic retrieval of personal photos by continuous space word representations., , , , and . ICASSP, page 5341-5345. IEEE, (2015)An augmented chart data structure with efficient word lattice parsing scheme in speech recognition applications., , and . Speech Commun., 10 (2): 129-144 (1991)Finding Complex Features for Guest Language Fragment Recovery in Resource-Limited Code-Mixed Speech Recognition., , and . IEEE ACM Trans. Audio Speech Lang. Process., 23 (12): 2148-2161 (2015)