Author of the publication

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization.

, , , , , and . CoRR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

An FPGA-based tree crown detection approach for remote sensing images., , , and . FPT, page 231-234. IEEE, (2017)Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization., , , , , and . CoRR, (2023)DropQueries: A Simple Way to Discover Comprehensive Segment Representations., , , , , , and . IEEE Trans. Multim., (2024)Navigating the Data Trading Crossroads: An Interdisciplinary Survey., , , , , , , , , and . CoRR, (2024)UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios., , , , , , , , , and . CoRR, (2024)MMBench: Is Your Multi-modal Model an All-Around Player?, , , , , , , , , and 2 other author(s). ECCV (6), volume 15064 of Lecture Notes in Computer Science, page 216-233. Springer, (2024)MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models., , , , , , , , , and . CoRR, (2024)SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models., , , , , , , , , and 9 other author(s). ICML, OpenReview.net, (2024)How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites., , , , , , , , , and 25 other author(s). CoRR, (2024)InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output., , , , , , , , , and 17 other author(s). CoRR, (2024)