Author of the publication

Cross-modal Contrastive Distillation for Instructional Activity Anticipation.

, , , , , , and . ICPR, page 5002-5009. IEEE, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Attentive Relational Networks for Mapping Images to Scene Graphs., , , , and . CoRR, (2018)Grounding-Tracking-Integration., , , and . CoRR, (2019)MM-Narrator: Narrating Long-form Videos with Multimodal In-Context Learning., , , , , , , and . CoRR, (2023)SAT: 2D Semantics Assisted Training for 3D Visual Grounding., , , and . ICCV, page 1836-1846. IEEE, (2021)SGFormer: Semantic Graph Transformer for Point Cloud-Based 3D Scene Graph Generation., , , , and . AAAI, page 4035-4043. AAAI Press, (2024)ReCo: Region-Controlled Text-to-Image Generation., , , , , , , , , and 1 other author(s). CVPR, page 14246-14255. IEEE, (2023)Improving One-Stage Visual Grounding by Recursive Sub-query Construction., , , and . ECCV (14), volume 12359 of Lecture Notes in Computer Science, page 387-404. Springer, (2020)Action Recognition with Visual Attention on Skeleton Images., , , and . ICPR, page 3309-3314. IEEE Computer Society, (2018)Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition., , , , , , , , and . CoRR, (2024)MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos., , , , , , , , , and 4 other author(s). CoRR, (2024)