Author of the publication

Combining multiple sources of knowledge in deep CNNs for action recognition.

, , , and . WACV, page 1-8. IEEE Computer Society, (2016)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Dance Dance Generation: Motion Transfer for Internet Videos., , , , and . ICCV Workshops, page 1208-1216. IEEE, (2019)ReferItGame: Referring to Objects in Photographs of Natural Scenes., , , and . EMNLP, page 787-798. ACL, (2014)VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation., , , , , , , , , and 5 other author(s). NeurIPS Datasets and Benchmarks, (2021)Revealing Single Frame Bias for Video-and-Language Learning., , and . ACL (1), page 487-507. Association for Computational Linguistics, (2023)CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval., , , , , , and . KDD, page 4433-4442. ACM, (2022)Visual to Sound: Generating Natural Sound for Videos in the Wild., , , , and . CVPR, page 3550-3558. Computer Vision Foundation / IEEE Computer Society, (2018)Parsing clothing in fashion photographs., , , and . CVPR, page 3570-3577. IEEE Computer Society, (2012)Who are you with and where are you going?, , , and . CVPR, page 1345-1352. IEEE Computer Society, (2011)Automatic Attribute Discovery and Characterization from Noisy Web Data., , and . ECCV (1), volume 6311 of Lecture Notes in Computer Science, page 663-676. Springer, (2010)TVQA+: Spatio-Temporal Grounding for Video Question Answering., , , and . CoRR, (2019)