Author of the publication

MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition.

, , , , , , and . CVPR, page 13577-13587. IEEE, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Camera-based vehicle velocity estimation from monocular video., , and . CoRR, (2018)Temporal Residual Networks for Dynamic Scene Recognition., , and . CVPR, page 7435-7444. IEEE Computer Society, (2017)Grounded Human-Object Interaction Hotspots From Video., , and . ICCV, page 8687-8696. IEEE, (2019)Diffusion Models as Masked Autoencoders., , , , , , , , , and . ICCV, page 16238-16248. IEEE, (2023)Reversible Vision Transformers., , , , , , and . CVPR, page 10820-10830. IEEE, (2022)Long-Term Feature Banks for Detailed Video Understanding., , , , , and . CVPR, page 284-293. Computer Vision Foundation / IEEE, (2019)PyTorchVideo: A Deep Learning Library for Video Understanding., , , , , , , , , and 6 other author(s). ACM Multimedia, page 3783-3786. ACM, (2021)A Multigrid Method for Efficiently Training Video Models., , , , and . CoRR, (2019)Multiview Pseudo-Labeling for Semi-supervised Learning from Video., , , and . ICCV, page 7189-7199. IEEE, (2021)A Multigrid Method for Efficiently Training Video Models., , , , and . CVPR, page 150-159. Computer Vision Foundation / IEEE, (2020)