Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Power Pooling: An Adaptive Pooling Function for Weakly Labelled Sound Event Detection., , , and . IJCNN, page 1-7. IEEE, (2021)Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output., , , and . INTERSPEECH, page 866-870. ISCA, (2022)High Fidelity Speech Enhancement with Band-split RNN., , , , and . INTERSPEECH, page 2483-2487. ISCA, (2023)Speaker-Invariant Feature-Mapping for Distant Speech Recognition via Adversarial Teacher-Student Learning., , , , and . INTERSPEECH, page 431-435. ISCA, (2019)Deep Convolutional Neural Network with Scalogram for Audio Scene Modeling., , , , , and . INTERSPEECH, page 3304-3308. ISCA, (2018)SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor., , , , , , , , , and . CoRR, (2024)Audio Scene Classification with Discriminatively-Trained Segment-Level Features., , and . ICME Workshops, page 354-359. IEEE, (2019)SECap: Speech Emotion Captioning with Large Language Model., , , , , , , , and . AAAI, page 19323-19331. AAAI Press, (2024)UniSep: Universal Target Audio Separation with Language Models at Scale., , , , , , , , , and . CoRR, (March 2025)AudioComposer: Towards Fine-grained Audio Generation with Natural Language Descriptions., , , , , and . CoRR, (2024)