From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption., , , , , , , , , и . CoRR, (2023)Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models., , , , , , , , , и . ECCV (37), том 13697 из Lecture Notes in Computer Science, стр. 638-656. Springer, (2022)Q-ASR: Integer-only Zero-shot Quantization for Efficient Speech Recognition., , , , , , , и . CoRR, (2021)Multitask Vision-Language Prompt Tuning., , , , , , и . WACV, стр. 5644-5655. IEEE, (2024)InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding., , , , , , , , , и . CoRR, (2024)COCO is ÄLL" You Need for Visual Instruction Fine-tuning., , , , и . CoRR, (2024)Image2Point: 3D Point-Cloud Understanding with Pretrained 2D ConvNets., , , , , , , , и . CoRR, (2021)Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning., , , , , , , , , и . CoRR, (2024)Integer-Only Zero-Shot Quantization for Efficient Speech Recognition., , , , , , , , , и . ICASSP, стр. 4288-4292. IEEE, (2022)Multitask Vision-Language Prompt Tuning., , , , , , и . CoRR, (2022)