Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions.

R. Tanaka, T. Iki, K. Nishida, K. Saito, and J. Suzuki. AAAI, page 19071-19079. AAAI Press, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Masao Tanaka

Masafumi Tanaka

Ryota Gemma

Shu Tanaka

Other publications of authors with the same name

How Well Do Vision Models Encode Diagram Attributes?H. Yoshida, K. Kudo, Y. Aoki, R. Tanaka, I. Saito, K. Sakaguchi, and K. Inui. ACL (Student Research Workshop), page 564-575. Association for Computational Linguistics, (2024)InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions.R. Tanaka, T. Iki, K. Nishida, K. Saito, and J. Suzuki. AAAI, page 19071-19079. AAAI Press, (2024)Empirical Analysis of Large Vision-Language Models against Goal Hijacking via Visual Prompt Injection.S. Kimura, R. Tanaka, S. Miyawaki, J. Suzuki, and K. Sakaguchi. CoRR, (2024)Different Modal Stereo: Simultaneous Estimation of Stereo Image Disparity and Modality Translation.R. Tanaka, F. Sakaue, and J. Sato. VISIGRAPP (4: VISAPP), page 554-560. SCITEPRESS, (2020)VisualMRC: Machine Reading Comprehension on Document Images.R. Tanaka, K. Nishida, and S. Yoshida. AAAI, page 13878-13888. AAAI Press, (2021)3D Pose-Based Temporal Action Segmentation for Figure Skating: A Fine-Grained and Jump Procedure-Aware Annotation Approach.R. Tanaka, T. Suzuki, and K. Fujii. MMSports@MM, page 17-26. ACM, (2024)Automatic Edge Error Judgment in Figure Skating Using 3D Pose Estimation from Inertial Sensors.R. Tanaka, T. Suzuki, K. Takeda, and K. Fujii. GCCE, page 1099-1100. IEEE, (2023)Pseudo-label based unsupervised fine-tuning of a monocular 3D pose estimation model for sports motions.T. Suzuki, R. Tanaka, K. Takeda, and K. Fujii. CVPR Workshops, page 3315-3324. IEEE, (2024)SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images.R. Tanaka, K. Nishida, K. Nishida, T. Hasegawa, I. Saito, and K. Saito. AAAI, page 13636-13645. AAAI Press, (2023)Automatic Edge Error Judgment in Figure Skating Using 3D Pose Estimation from a Monocular Camera and IMUs.R. Tanaka, T. Suzuki, K. Takeda, and K. Fujii. MMSports@MM, page 41-48. ACM, (2023)

BibSonomy

Disambiguation of "Tanaka, Ryota"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions.

Please choose a person to relate this publication to

Masao Tanaka

Masafumi Tanaka

Ryota Gemma

Ryota Gemma

Shu Tanaka

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Tanaka, Ryota"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions.

Please choose a person to relate this publication to

Masao Tanaka

Masafumi Tanaka

Ryota Gemma

Ryota Gemma

Shu Tanaka

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions.