Author of the publication

Informedia @ TRECVID 2018: Ad-hoc Video Search, Video to Text Description, Activities in Extended video.

, , , , , , , , , , , , , , , , , , and . TRECVID, National Institute of Standards and Technology (NIST), (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Learning Distributional Representation and Set Distance for Multi-shot Person Re-identification., , and . CoRR, (2018)RISE Video Dataset: Recognizing Industrial Smoke Emissions., , , , , , , , and . CoRR, (2020)Ensemble environment modeling using affine transform group., , , and . Speech Commun., (2015)Multi-Metrics Learning for Speech Enhancement., , , and . CoRR, (2017)SapAugment: Learning A Sample Adaptive Policy for Data Augmentation., , , , , , , and . ICASSP, page 4040-4044. IEEE, (2021)Discriminative Fuzzy Clustering Maximum a Posterior Linear Regression for Speaker Adaptation., , and . INTERSPEECH, page 567-570. ISCA, (2012)Ensemble of machine learning and acoustic segment model techniques for speech emotion and autism spectrum disorders recognition., , , , , , and . INTERSPEECH, page 215-219. ISCA, (2013)Learning of context-aware single image super-resolution., , , and . VCIP, page 1-4. IEEE, (2011)Automatic Transcription for Music with Two Timbres from Monaural Sound Source., , and . ISM, page 314-317. IEEE Computer Society, (2010)Unsupervised Style and Content Separation by Minimizing Mutual Information for Speech Synthesis., , , and . ICASSP, page 3267-3271. IEEE, (2020)