Inproceedings,

Attention-Based Cross-Modal Fusion for Audio-Visual Voice Activity Detection in Musical Video Streams.

, , , , , , and .
Interspeech, page 321-325. ISCA, (2021)

Meta data

Tags

Users

  • @dblp

Comments and Reviews