Author of the publication

DPT-FSNet: Dual-Path Transformer Based Full-Band and Sub-Band Fusion Network for Speech Enhancement.

, , and . ICASSP, page 6857-6861. IEEE, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Power Pooling: An Adaptive Pooling Function for Weakly Labelled Sound Event Detection., , , and . IJCNN, page 1-7. IEEE, (2021)High Fidelity Speech Enhancement with Band-split RNN., , , , and . INTERSPEECH, page 2483-2487. ISCA, (2023)Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output., , , and . INTERSPEECH, page 866-870. ISCA, (2022)Audio Scene Classification with Discriminatively-Trained Segment-Level Features., , and . ICME Workshops, page 354-359. IEEE, (2019)SECap: Speech Emotion Captioning with Large Language Model., , , , , , , , and . AAAI, page 19323-19331. AAAI Press, (2024)Gull: A Generative Multifunctional Audio Codec., , , , and . CoRR, (2024)Improved Guided Source Separation Integrated with a Strong Back-End for the CHiME-6 Dinner Party Scenario., , , and . INTERSPEECH, page 334-338. ISCA, (2020)Speaker-Invariant Feature-Mapping for Distant Speech Recognition via Adversarial Teacher-Student Learning., , , , and . INTERSPEECH, page 431-435. ISCA, (2019)Deep Convolutional Neural Network with Scalogram for Audio Scene Modeling., , , , , and . INTERSPEECH, page 3304-3308. ISCA, (2018)Bayes Risk Transducer: Transducer with Controllable Alignment Prediction., , , , , , and . INTERSPEECH, page 4968-4972. ISCA, (2023)