Author of the publication

eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer.

, , , , , , , , , and . INTERSPEECH, page 3387-3391. ISCA, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech., , , , , , and . ICASSP, page 6573-6577. IEEE, (2021)Controllable Emphasis with zero data for text-to-speech., , , , , , , , , and 4 other author(s). SSW, page 113-119. ISCA, (2023)Enhanced Disparity Computation for ADAS Applications., , and . GI Jahrestagung (2), volume P-110 of LNI, page 135-139. GI, (2007)A Learned Conditional Prior for the VAE Acoustic Space of a TTS System., , , , , , , and . Interspeech, page 3620-3624. ISCA, (2021)Stereo vision based pedestrian detection using B-spline modeling., , , and . ICVES, page 63-68. IEEE, (2008)A Geometric Approach to Obtain a Bird's Eye View from an Image., and . CoRR, (2019)CPW-Fed Bow-Tie Antenna for Ambient RF Energy Harvesting Applications., , , , , , , and . NILES, page 416-419. IEEE, (2023)Camp: A Two-Stage Approach to Modelling Prosody in Context., , , , , , , , and . ICASSP, page 6578-6582. IEEE, (2021)Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech., , , , , , and . CoRR, (2020)VCAN-Controller Area Network Based Human Vital Sign Data Transmission Protocol., , , and . CSEE (1), volume 214 of Communications in Computer and Information Science, page 290-296. Springer, (2011)