Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Fusing Pre-Trained Language Models with Multimodal Prompts through Reinforcement Learning., , , , , , , , , and 1 other author(s). CVPR, page 10845-10856. IEEE, (2023)A Mobile Robot Generating Video Summaries of Seniors' Indoor Activities., , , and . MobileHCI, page 54:1-54:6. ACM, (2019)Panoramic Vision Transformer for Saliency Detection in 360° Videos., , and . CoRR, (2022)Dense 2D-3D Indoor Prediction with Sound via Aligned Cross-Modal Distillation., , and . ICCV, page 7829-7838. IEEE, (2023)Multimodal Knowledge Alignment with Reinforcement Learning., , , , , , , , , and 1 other author(s). CoRR, (2022)Character Grounding and Re-identification in Story of Videos and Text Descriptions., , , , and . ECCV (5), volume 12350 of Lecture Notes in Computer Science, page 543-559. Springer, (2020)Video Summarization through Human Detection on a Social Robot., , and . CoRR, (2019)Panoramic Vision Transformer for Saliency Detection in 360$^$ Videos., , and . ECCV (35), volume 13695 of Lecture Notes in Computer Science, page 422-439. Springer, (2022)Pano-AVQA: Grounded Audio-Visual Question Answering on 360° Videos., , , , and . ICCV, page 2011-2021. IEEE, (2021)Transitional Adaptation of Pretrained Models for Visual Storytelling., , , , and . CVPR, page 12658-12668. Computer Vision Foundation / IEEE, (2021)