Author of the publication

A Multi-Modal Fusion Approach for Audio-Visual Scene Classification Enhanced by CLIP Variants.

, , and . DCASE, page 95-99. (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Fast Method for Face Detection Based on the Characteristic of Cascade Classifier., , and . IPSJ Trans. Comput. Vis. Appl., (2011)Multi-Stream Adaptive Graph Convolutional Network Using Inter- and Intra-Body Graphs for Two-Person Interaction Recognition., , , and . IEEE Access, (2021)Audio-Visual Speech Recognition Using Lip Information Extracted from Side-Face Images., , , and . EURASIP J. Audio Speech Music. Process., (2007)Hierarchical contrastive adaptation for cross-domain object detection., , , and . Mach. Vis. Appl., 33 (4): 62 (2022)NII Hitachi UIT at TRECVID 2019., , , , , , , , , and 10 other author(s). TRECVID, National Institute of Standards and Technology (NIST), (2019)QPIC: Query-Based Pairwise Human-Object Interaction Detection With Image-Wide Contextual Information., , and . CVPR, page 10410-10419. Computer Vision Foundation / IEEE, (2021)Segmentation-Based Bounding Box Generation for Omnidirectional Pedestrian Detection., and . CoRR, (2021)BCaR: Beginner Classifier as Regularization Towards Generalizable Re-ID., and . BMVC, BMVA Press, (2020)Hitachi at TRECVID DSDI 2020., , , and . TRECVID, National Institute of Standards and Technology (NIST), (2020)Cycle-Contrast for Self-Supervised Video Representation Learning., , , , and . NeurIPS, (2020)