Author of the publication

BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision.

, , , , , , , , , , , and . CVPR, page 17830-17839. IEEE, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Scene as Occupancy., , , , , , , , , and 1 other author(s). ICCV, page 8372-8381. IEEE, (2023)BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision., , , , , , , , , and 2 other author(s). CVPR, page 17830-17839. IEEE, (2023)OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text., , , , , , , , , and 30 other author(s). CoRR, (2024)Masked AutoDecoder is Effective Multi-Task Vision Generalist., , , , , and . CoRR, (2024)InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions., , , , , , , , , and 2 other author(s). CVPR, page 14408-14419. IEEE, (2023)How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites., , , , , , , , , and 25 other author(s). CoRR, (2024)Learning 1D Causal Visual Representation with De-focus Attention Networks., , , , , , , , , and 1 other author(s). CoRR, (2024)Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures., , , , , , , , , and . CoRR, (2024)Towards All-in-One Pre-Training via Maximizing Multi-Modal Mutual Information., , , , , , , , , and . CVPR, page 15888-15899. IEEE, (2023)Needle In A Multimodal Haystack., , , , , , , , , and 6 other author(s). CoRR, (2024)