From post

Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning.

, , , , и . ECCV (25), том 13685 из Lecture Notes in Computer Science, стр. 328-345. Springer, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Semi-supervised Hashing with Semantic Confidence for Large Scale Visual Search., , , , и . SIGIR, стр. 53-62. ACM, (2015)iDirector: An Intelligent Directing System for Live Broadcast., , , , , , и . ACM Multimedia, стр. 4545-4547. ACM, (2020)Hierarchy Parsing for Image Captioning., , , и . ICCV, стр. 2621-2629. IEEE, (2019)Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects., , , и . CVPR, стр. 5263-5271. IEEE Computer Society, (2017)Video Captioning with Transferred Semantic Attributes., , , и . CVPR, стр. 984-992. IEEE Computer Society, (2017)Semi-supervised Domain Adaptation with Subspace Learning for visual recognition., , , , и . CVPR, стр. 2142-2150. IEEE Computer Society, (2015)VireoJD-MM @ TRECVid 2019: Activities in Extended Video (ActEV)., , , и . TRECVID, National Institute of Standards and Technology (NIST), (2019)Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training., , , , , и . ACM Multimedia, стр. 7070-7074. ACM, (2022)Contextual and selective attention networks for image captioning., , , , , и . Sci. China Inf. Sci., (2022)Out-of-Distribution Detection via Conditional Kernel Independence Model., , , , , , и . NeurIPS, (2022)