Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Navigating Audio-Visual Event Detection Across Mismatched Modalities., , , and . ICASSP, page 1975-1979. IEEE, (2022)PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation., , , and . CoRR, (2024)Investigating Passive Filter Pruning for Efficient CNN-Transformer Audio Captioning., , , , and . MLSP, page 1-6. IEEE, (2024)Enhancing Audio Generation Diversity with Visual Information., , , , and . ICASSP, page 866-870. IEEE, (2024)Towards Weakly Supervised Text-to-Audio Grounding., , , and . IEEE Trans. Multim., (2024)DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning., , , , , , , and . ICASSP, page 1-5. IEEE, (2025)PicoAudio: Enabling Precise Temporal Controllability in Text-to-Audio Generation., , , and . ICASSP, page 1-5. IEEE, (2025)Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning., , , , and . ICASSP, page 905-909. IEEE, (2021)Diversity-Controllable and Accurate Audio Captioning Based on Neural Condition., , and . ICASSP, page 971-975. IEEE, (2022)A Lightweight Framework for Online Voice Activity Detection in the Wild., , , and . Interspeech, page 371-375. ISCA, (2021)