Author of the publication

Segment-Level Metric Learning for Few-Shot Bioacoustic Event Detection.

, , , , , and . DCASE, Tampere University, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

FlashSpeech: Efficient Zero-Shot Speech Synthesis., , , , , , , , , and 3 other author(s). CoRR, (2024)Neural Vocoder is All You Need for Speech Super-resolution., , , , , and . INTERSPEECH, page 4227-4231. ISCA, (2022)Separate Anything You Describe., , , , , , , , , and . CoRR, (2023)Fish Tracking, Counting, and Behaviour Analysis in Digital Aquaculture: A Comprehensive Review., , , , , and . CoRR, (2024)AudioLDM 2: Learning Holistic Audio Generation With Self-Supervised Pretraining., , , , , , , , , and . IEEE ACM Trans. Audio Speech Lang. Process., (2024)Audiosr: Versatile Audio Super-Resolution at Scale., , , , and . ICASSP, page 1076-1080. IEEE, (2024)Leveraging Pre-trained BERT for Audio Captioning., , , , , , , , and . EUSIPCO, page 1145-1149. IEEE, (2022)T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining., , , , , , , , and . CoRR, (2024)NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality., , , , , , , , , and 4 other author(s). CoRR, (2022)WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research., , , , , , , , and . IEEE ACM Trans. Audio Speech Lang. Process., (2024)