Article,

MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation.

, , , , , , , , , , , , , , and .
CoRR, (2024)

Meta data

Tags

Users

  • @dblp

Comments and Reviews