Author of the publication

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension.

, , , , , , , , , , and . ACL (1), page 1979-1998. Association for Computational Linguistics, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models., , , , , , and . ACL (Findings), page 11655-11671. Association for Computational Linguistics, (2023)A Robotic Communication Middleware Combining High Performance and High Reliability., , , , and . SBAC-PAD, page 217-224. IEEE, (2020)Zero-shot Explainable Mental Health Analysis on Social Media by Incorporating Mental Scales., , , , , and . WWW (Companion Volume), page 959-962. ACM, (2024)Precise Apple Detection and Localization in Orchards using YOLOv5 for Robotic Harvesting Systems., , and . CoRR, (2024)FastDiff 2: Revisiting and Incorporating GANs and Diffusion Models in High-Fidelity Speech Synthesis., , , , , and . ACL (Findings), page 6994-7009. Association for Computational Linguistics, (2023)Self-Supervised Spoofing Audio Detection Scheme., , , , and . INTERSPEECH, page 4223-4227. ISCA, (2020)Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis., , , , , , , , , and 3 other author(s). ICLR, OpenReview.net, (2024)TextrolSpeech: A Text Style Control Speech Corpus with Codec Language Text-to-Speech Models., , , , , , , and . ICASSP, page 10301-10305. IEEE, (2024)MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis., , , , , , , , and . CoRR, (2024)DiffEditor: Enhancing Speech Editing with Semantic Enrichment and Acoustic Consistency., , , , , , and . CoRR, (2024)