Author of the publication

Beyond OCR + VQA: Involving OCR into the Flow for Robust and Accurate TextVQA.

, , , and . ACM Multimedia, page 376-385. ACM, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Enabling Natural Human-Computer Interaction Through AI-Powered Nanocomposite IoT Throat Vibration Sensor., , , , , , , , , and . IEEE Internet Things J., 11 (14): 24761-24774 (July 2024)Modeling Scattering Coefficients using Self-Attentive Complex Polynomials with Image-based Representation., , , , , , , , , and . CoRR, (2023)A Cost-Efficient Framework for Scene Text Detection in the Wild., , , and . PRICAI (1), volume 13031 of Lecture Notes in Computer Science, page 139-153. Springer, (2021)MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection., , , , , , , , , and . ICLR, OpenReview.net, (2023)Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length., , , , , , , , , and . CoRR, (2024)Feature-Selection High-Resolution Network With Hypersphere Embedding for Semantic Segmentation of VHR Remote Sensing Images., , , , , and . IEEE Trans. Geosci. Remote. Sens., (2022)Individual Recognition Method of Radiation Source Based on Deep Subdomain Adaptation Network., , , and . ICCT, page 1132-1137. IEEE, (2022)AutoCAT: Reinforcement Learning for Automated Exploration of Cache Timing-Channel Attacks., , , , , , , , and . CoRR, (2022)Masked and Permuted Implicit Context Learning for Scene Text Recognition., , , , , , , and . CoRR, (2023)Accurate and Robust Scene Text Recognition via Adversarial Training., , , and . ICASSP, page 4275-4279. IEEE, (2024)