Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Diversity-Controllable and Accurate Audio Captioning Based on Neural Condition.

X. Xu, M. Wu, and K. Yu. ICASSP, page 971-975. IEEE, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Xu Xu

Xianxiang Xu

Imaging findings of cerebral schistosomiasis in ChinaX. Xu. Uni Hamburg, (2010)

Yan Xu

Untersuchungen zur Verleimung von Holz und Holzspanplatten mit UF-Leimharzen und PMDIY. Xu. TU Braunschweig, (2009)

Yan Xu

Other publications of authors with the same name

Navigating Audio-Visual Event Detection Across Mismatched Modalities.G. Li, X. Xu, M. Wu, and K. Yu. ICASSP, page 1975-1979. IEEE, (2022)PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation.Z. Xie, X. Xu, Z. Wu, and M. Wu. CoRR, (2024)Investigating Passive Filter Pruning for Efficient CNN-Transformer Audio Captioning.X. Xu, A. Singh, M. Wu, W. Wang, and M. Plumbley. MLSP, page 1-6. IEEE, (2024)Enhancing Audio Generation Diversity with Visual Information.Z. Xie, B. Li, X. Xu, M. Wu, and K. Yu. ICASSP, page 866-870. IEEE, (2024)Towards Weakly Supervised Text-to-Audio Grounding.X. Xu, Z. Ma, M. Wu, and K. Yu. IEEE Trans. Multim., (2024)DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning.X. Li, W. Chen, Z. Ma, X. Xu, Y. Liang, Z. Zheng, Q. Kong, and X. Chen. ICASSP, page 1-5. IEEE, (2025)PicoAudio: Enabling Precise Temporal Controllability in Text-to-Audio Generation.Z. Xie, X. Xu, Z. Wu, and M. Wu. ICASSP, page 1-5. IEEE, (2025)Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning.X. Xu, H. Dinkel, M. Wu, Z. Xie, and K. Yu. ICASSP, page 905-909. IEEE, (2021)Diversity-Controllable and Accurate Audio Captioning Based on Neural Condition.X. Xu, M. Wu, and K. Yu. ICASSP, page 971-975. IEEE, (2022)A Lightweight Framework for Online Voice Activity Detection in the Wild.X. Xu, H. Dinkel, M. Wu, and K. Yu. Interspeech, page 371-375. ISCA, (2021)

BibSonomy

Disambiguation of "Xu, Xuenan"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Diversity-Controllable and Accurate Audio Captioning Based on Neural Condition.

Please choose a person to relate this publication to

Xu Xu

Xu Xu

Xianxiang Xu

Yan Xu

Yan Xu

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Xu, Xuenan"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Diversity-Controllable and Accurate Audio Captioning Based on Neural Condition.

Please choose a person to relate this publication to

Xu Xu

Xu Xu

Xianxiang Xu

Yan Xu

Yan Xu

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Diversity-Controllable and Accurate Audio Captioning Based on Neural Condition.