Author of the publication

A Simple Single-Scale Vision Transformer for Object Detection and Instance Segmentation.

, , , , , , , , , , and . ECCV (10), volume 13670 of Lecture Notes in Computer Science, page 711-727. Springer, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Revisiting Neural Scaling Laws in Language and Vision., , and . NeurIPS, (2022)An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale., , , , , , , , , and 2 other author(s). ICLR, OpenReview.net, (2021)A Simple Single-Scale Vision Transformer for Object Detection and Instance Segmentation., , , , , , , , , and 1 other author(s). ECCV (10), volume 13670 of Lecture Notes in Computer Science, page 711-727. Springer, (2022)PKU_ICST at TRECVID2013 : Instance Search Task., , , , , , , and . TRECVID, National Institute of Standards and Technology (NIST), (2013)Self-Supervised Generative Adversarial Networks., , , , and . CoRR, (2018)The Visual Task Adaptation Benchmark, , , , , , , , , and 7 other author(s). (2019)cite arxiv:1910.04867.Adaptive Control Based on Recurrent Fuzzy Wavelet Neural Network and Its Application on Robotic Tracking Control., , and . ISNN (2), volume 3972 of Lecture Notes in Computer Science, page 1166-1171. Springer, (2006)PaliGemma 2: A Family of Versatile VLMs for Transfer., , , , , , , , , and 8 other author(s). CoRR, (2024)On Scaling Up a Multilingual Vision and Language Model., , , , , , , , , and 33 other author(s). CVPR, page 14432-14444. IEEE, (2024)High-Fidelity Image Generation With Fewer Labels., , , , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 4183-4192. PMLR, (2019)