Author of the publication

Panoramic Vision Transformer for Saliency Detection in 360$^$ Videos.

, , and . ECCV (35), volume 13695 of Lecture Notes in Computer Science, page 422-439. Springer, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Panoramic Vision Transformer for Saliency Detection in 360° Videos., , and . CoRR, (2022)A Mobile Robot Generating Video Summaries of Seniors' Indoor Activities., , , and . MobileHCI, page 54:1-54:6. ACM, (2019)Fusing Pre-Trained Language Models with Multimodal Prompts through Reinforcement Learning., , , , , , , , , and 1 other author(s). CVPR, page 10845-10856. IEEE, (2023)Multimodal Knowledge Alignment with Reinforcement Learning., , , , , , , , , and 1 other author(s). CoRR, (2022)Dense 2D-3D Indoor Prediction with Sound via Aligned Cross-Modal Distillation., , and . ICCV, page 7829-7838. IEEE, (2023)Character Grounding and Re-identification in Story of Videos and Text Descriptions., , , , and . ECCV (5), volume 12350 of Lecture Notes in Computer Science, page 543-559. Springer, (2020)Video Summarization through Human Detection on a Social Robot., , and . CoRR, (2019)Pano-AVQA: Grounded Audio-Visual Question Answering on 360° Videos., , , , and . ICCV, page 2011-2021. IEEE, (2021)Panoramic Vision Transformer for Saliency Detection in 360$^$ Videos., , and . ECCV (35), volume 13695 of Lecture Notes in Computer Science, page 422-439. Springer, (2022)Transitional Adaptation of Pretrained Models for Visual Storytelling., , , , and . CVPR, page 12658-12668. Computer Vision Foundation / IEEE, (2021)