From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Scene as Occupancy., , , , , , , , , и 1 other автор(ы). ICCV, стр. 8372-8381. IEEE, (2023)OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text., , , , , , , , , и 30 other автор(ы). CoRR, (2024)BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision., , , , , , , , , и 2 other автор(ы). CVPR, стр. 17830-17839. IEEE, (2023)Masked AutoDecoder is Effective Multi-Task Vision Generalist., , , , , и . CoRR, (2024)InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions., , , , , , , , , и 2 other автор(ы). CVPR, стр. 14408-14419. IEEE, (2023)Learning 1D Causal Visual Representation with De-focus Attention Networks., , , , , , , , , и 1 other автор(ы). CoRR, (2024)Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures., , , , , , , , , и . CoRR, (2024)How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites., , , , , , , , , и 25 other автор(ы). CoRR, (2024)Needle In A Multimodal Haystack., , , , , , , , , и 6 other автор(ы). CoRR, (2024)Towards All-in-One Pre-Training via Maximizing Multi-Modal Mutual Information., , , , , , , , , и . CVPR, стр. 15888-15899. IEEE, (2023)