Author of the publication

Diversity-Controllable and Accurate Audio Captioning Based on Neural Condition.

, , and . ICASSP, page 971-975. IEEE, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Diversity-Controllable and Accurate Audio Captioning Based on Neural Condition., , and . ICASSP, page 971-975. IEEE, (2022)Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning., , , , and . ICASSP, page 905-909. IEEE, (2021)A Lightweight Framework for Online Voice Activity Detection in the Wild., , , and . Interspeech, page 371-375. ISCA, (2021)DiveSound: LLM-Assisted Automatic Taxonomy Construction for Diverse Audio Generation., , , , , , , and . CoRR, (2024)Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models., , , , and . CoRR, (2024)T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining., , , , , , , , and . MLSP, page 1-6. IEEE, (2024)Category-Adapted Sound Event Enhancement with Weakly Labeled Data., , , , and . ICASSP, page 851-855. IEEE, (2022)Text-to-Audio Grounding: Building Correspondence Between Captions and Sound Events., , , and . ICASSP, page 606-610. IEEE, (2021)BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data., , , , , , and . ACM Multimedia, page 2756-2764. ACM, (2023)AudioTime: A Temporally-aligned Audio-text Benchmark Dataset., , , and . CoRR, (2024)