Author of the publication

Acoustic Scene Classification by Implicitly Identifying Distinct Sound Events.

, , , and . INTERSPEECH, page 3860-3864. ISCA, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge., , , , , , , , , and 2 other author(s). CoRR, (2021)Double Adversarial Network Based Monaural Speech Enhancement for Robust Speech Recognition., , and . INTERSPEECH, page 309-313. ISCA, (2020)Acoustic Scene Classification by Implicitly Identifying Distinct Sound Events., , , and . INTERSPEECH, page 3860-3864. ISCA, (2019)TOLD: a Novel Two-Stage Overlap-Aware Framework for Speaker Diarization., , and . ICASSP, page 1-5. IEEE, (2023)Investigation of Monaural Front-End Processing for Robust Speech Recognition Without Retraining or Joint-Training., , and . APSIPA, page 249-254. IEEE, (2019)CASA-ASR: Context-Aware Speaker-Attributed ASR., , , , , , , and . CoRR, (2023)Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios., , , and . CoRR, (2022)FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec., , , and . CoRR, (2023)An Embarrassingly Simple Approach for LLM with Strong ASR Capacity., , , , , , , , , and 1 other author(s). CoRR, (2024)M2Met: The Icassp 2022 Multi-Channel Multi-Party Meeting Transcription Challenge., , , , , , , , , and 2 other author(s). ICASSP, page 6167-6171. IEEE, (2022)