Author of the publication

GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval.

, , , , , and . ECCV (35), volume 13695 of Lecture Notes in Computer Science, page 709-725. Springer, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Env-QA: A Video Question Answering Benchmark for Comprehensive Understanding of Dynamic Environments., , , and . ICCV, page 1655-1665. IEEE, (2021)GazeVQA: A Video Question Answering Dataset for Multiview Eye-Gaze Task-Oriented Collaborations., , , , , , , and . EMNLP, page 10462-10479. Association for Computational Linguistics, (2023)VideoGUI: A Benchmark for GUI Automation from Instructional Videos., , , , , , , and . CoRR, (2024)CVPR 2023 Text Guided Video Editing Competition., , , , , , , , , and 10 other author(s). CoRR, (2023)Correlated warped Gaussian processes for gender-specific age estimation., , , , and . ICIP, page 133-137. IEEE, (2015)Egocentric Video-Language Pretraining., , , , , , , , , and 6 other author(s). NeurIPS, (2022)GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval., , , , , and . ECCV (35), volume 13695 of Lecture Notes in Computer Science, page 709-725. Springer, (2022)MIST : Multi-modal Iterative Spatial-Temporal Transformer for Long-form Video Question Answering., , , , , and . CVPR, page 14773-14783. IEEE, (2023)Egocentric Video-Language Pretraining @ Ego4D Challenge 2022., , , , , , , , , and 6 other author(s). CoRR, (2022)AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant., , , , , , and . EMNLP (Findings), page 319-338. Association for Computational Linguistics, (2022)