Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition.

K. Puvvada, N. Koluguri, K. Dhawan, J. Balam, and B. Ginsburg. ICASSP, page 12111-12115. IEEE, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Rachel Ginsburg

Sigmar Ginsburg

Sarah Ginsburg

Bernhard Ginsburg

Samuel Ginsburg

Other publications of authors with the same name

Flexible Multichannel Speech Enhancement for Noise-Robust Frontend.A. Jukic, J. Balam, and B. Ginsburg. WASPAA, page 1-5. IEEE, (2023)LibriSpeech-PC: Benchmark for Evaluation of Punctuation and Capitalization Capabilities of End-to-End ASR Models.A. Meister, M. Novikov, N. Karpov, E. Bakhturina, V. Lavrukhin, and B. Ginsburg. ASRU, page 1-7. IEEE, (2023)Training Neural Speech Recognition Systems with Synthetic Speech Augmentation.J. Li, R. Gadde, B. Ginsburg, and V. Lavrukhin. CoRR, (2018)Training Deep AutoEncoders for Collaborative Filtering.O. Kuchaiev, and B. Ginsburg. CoRR, (2017)The ForSpec Temporal Logic: A New Temporal Property-Specification Language.R. Armoni, L. Fix, A. Flaisher, R. Gerth, B. Ginsburg, T. Kanza, A. Landver, S. Mador-Haim, E. Singerman, A. Tiemeyer and 2 other author(s). TACAS, volume 2280 of Lecture Notes in Computer Science, page 296-211. Springer, (2002)Cross-Language Transfer Learning and Domain Adaptation for End-to-End Automatic Speech Recognition.J. Luo, J. Wang, N. Cheng, E. Xiao, J. Xiao, G. Kucsko, P. O'Neill, J. Balam, S. Deng, A. Flores and 5 other author(s). ICME, page 1-6. IEEE, (2021)TalkNet: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis.S. Beliaev, and B. Ginsburg. Interspeech, page 3760-3764. ISCA, (2021)CTC Variations Through New WFST Topologies.A. Laptev, S. Majumdar, and B. Ginsburg. INTERSPEECH, page 1041-1045. ISCA, (2022)Label-Looping: Highly Efficient Decoding for Transducers.V. Bataev, H. Xu, D. Galvez, V. Lavrukhin, and B. Ginsburg. CoRR, (2024)SALM: Speech-Augmented Language Model with in-Context Learning for Speech Recognition and Translation.Z. Chen, H. Huang, A. Andrusenko, O. Hrinchuk, K. Puvvada, J. Li, S. Ghosh, J. Balam, and B. Ginsburg. ICASSP, page 13521-13525. IEEE, (2024)

BibSonomy

Disambiguation of "Ginsburg, Boris"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition.

Please choose a person to relate this publication to

Rachel Ginsburg

Sigmar Ginsburg

Sarah Ginsburg

Bernhard Ginsburg

Samuel Ginsburg

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Ginsburg, Boris"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition.

Please choose a person to relate this publication to

Rachel Ginsburg

Sigmar Ginsburg

Sarah Ginsburg

Bernhard Ginsburg

Samuel Ginsburg

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition.