From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input., , , , , и . ECCV (6), том 11210 из Lecture Notes in Computer Science, стр. 659-677. Springer, (2018)Adversarial Input Ablation for Audio-Visual Learning., и . ICASSP, стр. 7742-7746. IEEE, (2022)Fast-Slow Transformer for Visually Grounding Speech., и . ICASSP, стр. 7727-7731. IEEE, (2022)Unsupervised Fine-Tuning Data Selection for ASR Using Self-Supervised Speech Models., и . ICASSP, стр. 1-5. IEEE, (2023)Learning Words by Drawing Images., , , , , и . CVPR, стр. 2029-2038. Computer Vision Foundation / IEEE, (2019)A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition., , , , , , , , , и 17 other автор(ы). ICASSP, стр. 8111-8115. IEEE, (2013)Word Discovery in Visually Grounded, Self-Supervised Speech Models., и . INTERSPEECH, стр. 2823-2827. ISCA, (2022)MAE-AST: Masked Autoencoding Audio Spectrogram Transformer., , и . INTERSPEECH, стр. 2438-2442. ISCA, (2022)SpeechCLIP+: Self-Supervised Multi-Task Representation Learning for Speech Via Clip and Speech-Image Data., , , , , , , и . ICASSP Workshops, стр. 465-469. IEEE, (2024)Learning to Map Efficiently by Active Echolocation., , , и . IROS, стр. 1505-1510. (2023)