Author of the publication

AudioCLIP: Extending CLIP to Image, Text and Audio

, , , and . (2021)cite arxiv:2106.13043Comment: submitted to GCPR 2021.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Location-Specific Embedding Learning for the Semantic Segmentation of Building Footprints on a Global Scale., , , and . IGARSS, page 3951-3954. IEEE, (2019)Audioclip: Extending Clip to Image, Text and Audio., , , and . ICASSP, page 976-980. IEEE, (2022)Cross-Domain Transformation for Outlier Detection on Tabular Datasets., , , , , and . IJCNN, page 1-8. IEEE, (2023)DT2I: Dense Text-to-Image Generation from Region Descriptions., , , and . ICANN (2), volume 13530 of Lecture Notes in Computer Science, page 395-406. Springer, (2022)Sequential Spatial Transformer Networks for Salient Object Classification., , , , , and . ICPRAM, page 328-335. SCITEPRESS, (2023)AudioCLIP: Extending CLIP to Image, Text and Audio., , , and . CoRR, (2021)RDF Spreadsheet Editor: Get (G)rid of Your RDF Data Entry Problems., , , , and . ISWC (Posters, Demos & Industry Tracks), volume 1963 of CEUR Workshop Proceedings, CEUR-WS.org, (2017)Combining Transformer Generators with Convolutional Discriminators., , , , , , and . KI, volume 12873 of Lecture Notes in Computer Science, page 67-79. Springer, (2021)Fusion Strategies for Learning User Embeddings with Neural Networks., , , , and . IJCNN, page 1-8. IEEE, (2019)Multi-Scale Machine Learning for the Classification of Building Property Values., , , , and . IGARSS, page 4873-4876. IEEE, (2019)