Author of the publication

Waffling around for Performance: Visual Classification with Random Words and Broad Concepts.

, , , , , and . ICCV, page 15700-15711. IEEE, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Waffling around for Performance: Visual Classification with Random Words and Broad Concepts., , , , , and . ICCV, page 15700-15711. IEEE, (2023)A SOUND APPROACH: Using Large Language Models to generate audio descriptions for egocentric text-audio retrieval., , , , and . CoRR, (2024)Audio Retrieval with Natural Language Queries: A Benchmark Study., , , , and . CoRR, (2021)Sight to Sound: An End-to-End Approach for Visual Piano Transcription., , , and . ICASSP, page 1838-1842. IEEE, (2020)Zero-Shot Translation of Attention Patterns in VQA Models to Natural Language., , , and . DAGM, volume 14264 of Lecture Notes in Computer Science, page 378-393. Springer, (2023)X2Face: A network for controlling face generation by using images, audio, and pose codes., , and . CoRR, (2018)Audio Retrieval With Natural Language Queries: A Benchmark Study., , , , and . IEEE Trans. Multim., (2023)Self-Supervised Learning of Class Embeddings from Video., , and . ICCV Workshops, page 3019-3027. IEEE, (2019)Audio Retrieval with Natural Language Queries., , , , and . Interspeech, page 2411-2415. ISCA, (2021)X2Face: A Network for Controlling Face Generation Using Images, Audio, and Pose Codes., , and . ECCV (13), volume 11217 of Lecture Notes in Computer Science, page 690-706. Springer, (2018)