Author of the publication

Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition.

, , , , and . ICASSP, page 12111-12115. IEEE, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Flexible Multichannel Speech Enhancement for Noise-Robust Frontend., , and . WASPAA, page 1-5. IEEE, (2023)LibriSpeech-PC: Benchmark for Evaluation of Punctuation and Capitalization Capabilities of End-to-End ASR Models., , , , , and . ASRU, page 1-7. IEEE, (2023)Training Neural Speech Recognition Systems with Synthetic Speech Augmentation., , , and . CoRR, (2018)Training Deep AutoEncoders for Collaborative Filtering., and . CoRR, (2017)The ForSpec Temporal Logic: A New Temporal Property-Specification Language., , , , , , , , , and 2 other author(s). TACAS, volume 2280 of Lecture Notes in Computer Science, page 296-211. Springer, (2002)Cross-Language Transfer Learning and Domain Adaptation for End-to-End Automatic Speech Recognition., , , , , , , , , and 5 other author(s). ICME, page 1-6. IEEE, (2021)TalkNet: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis., and . Interspeech, page 3760-3764. ISCA, (2021)CTC Variations Through New WFST Topologies., , and . INTERSPEECH, page 1041-1045. ISCA, (2022)Label-Looping: Highly Efficient Decoding for Transducers., , , , and . CoRR, (2024)SALM: Speech-Augmented Language Model with in-Context Learning for Speech Recognition and Translation., , , , , , , , and . ICASSP, page 13521-13525. IEEE, (2024)