Author of the publication

Attention-Based Cross-Modal Fusion for Audio-Visual Voice Activity Detection in Musical Video Streams.

, , , , , , and . Interspeech, page 321-325. ISCA, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A novel audio fingerprinting method robust to time scale modification and pitch shifting., , , and . ACM Multimedia, page 987-990. ACM, (2010)Latent time-frequency component analysis: A novel pitch-based approach for singing voice separation., , and . ICASSP, page 131-135. IEEE, (2015)Robust hashing for music copyright protection by combining beat segmentation and chroma., , , and . ACM Multimedia, page 935-938. ACM, (2010)S3T: Self-Supervised Pre-Training with Swin Transformer For Music Classification., , , , and . ICASSP, page 606-610. IEEE, (2022)Bytecover2: Towards Dimensionality Reduction of Latent Embedding for Efficient Cover Song Identification., , , , and . ICASSP, page 616-620. IEEE, (2022)Vocal Melody Extraction via DNN-based Pitch Estimation and Salience-based Pitch Refinement., , , , , and . ICASSP, page 1000-1004. IEEE, (2019)Towards Solving the Bottleneck of Pitch-based Singing Voice Separation., , and . ACM Multimedia, page 511-520. ACM, (2015)Singing Melody Extraction from Polyphonic Music based on Spectral Correlation Modeling., , , and . ICASSP, page 241-245. IEEE, (2021)Bytecover3: Accurate Cover Song Identification On Short Queries., , , , , and . ICASSP, page 1-5. IEEE, (2023)On the music content authentication., , and . ACM Multimedia, page 1101-1104. ACM, (2012)