Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A Multi-Modal Fusion Approach for Audio-Visual Scene Classification Enhanced by CLIP Variants.

S. Okazaki, Q. Kong, and T. Yoshinaga. DCASE, page 95-99. (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Tomoaki Fukai

Other publications of authors with the same name

Fast Method for Face Detection Based on the Characteristic of Cascade Classifier.T. Yoshinaga, S. Nagaya, and I. Karube. IPSJ Trans. Comput. Vis. Appl., (2011)Multi-Stream Adaptive Graph Convolutional Network Using Inter- and Intra-Body Graphs for Two-Person Interaction Recognition.Y. Ito, K. Morita, Q. Kong, and T. Yoshinaga. IEEE Access, (2021)Audio-Visual Speech Recognition Using Lip Information Extracted from Side-Face Images.K. Iwano, T. Yoshinaga, S. Tamura, and S. Furui. EURASIP J. Audio Speech Music. Process., (2007)Hierarchical contrastive adaptation for cross-domain object detection.Z. Deng, Q. Kong, N. Akira, and T. Yoshinaga. Mach. Vis. Appl., 33 (4): 62 (2022)NII Hitachi UIT at TRECVID 2019.M. Klinkigt, D. Le, A. Hiroike, H. Vo, M. Chabra, V. Dang, Q. Kong, V. Nguyen, T. Murakami, T. Do and 10 other author(s). TRECVID, National Institute of Standards and Technology (NIST), (2019)QPIC: Query-Based Pairwise Human-Object Interaction Detection With Image-Wide Contextual Information.M. Tamura, H. Ohashi, and T. Yoshinaga. CVPR, page 10410-10419. Computer Vision Foundation / IEEE, (2021)Segmentation-Based Bounding Box Generation for Omnidirectional Pedestrian Detection.M. Tamura, and T. Yoshinaga. CoRR, (2021)BCaR: Beginner Classifier as Regularization Towards Generalizable Re-ID.M. Tamura, and T. Yoshinaga. BMVC, BMVA Press, (2020)Hitachi at TRECVID DSDI 2020.S. Okazaki, Q. Kong, M. Klinkigt, and T. Yoshinaga. TRECVID, National Institute of Standards and Technology (NIST), (2020)Cycle-Contrast for Self-Supervised Video Representation Learning.Q. Kong, W. Wei, Z. Deng, T. Yoshinaga, and T. Murakami. NeurIPS, (2020)

BibSonomy

Disambiguation of "Yoshinaga, Tomoaki"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A Multi-Modal Fusion Approach for Audio-Visual Scene Classification Enhanced by CLIP Variants.

Please choose a person to relate this publication to

Tomoaki Fukai

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Yoshinaga, Tomoaki"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML A Multi-Modal Fusion Approach for Audio-Visual Scene Classification Enhanced by CLIP Variants.

Please choose a person to relate this publication to

Tomoaki Fukai

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A Multi-Modal Fusion Approach for Audio-Visual Scene Classification Enhanced by CLIP Variants.