Author of the publication

Informedia @ TRECVID 2018: Ad-hoc Video Search, Video to Text Description, Activities in Extended video.

, , , , , , , , , , , , , , , , , , and . TRECVID, National Institute of Standards and Technology (NIST), (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Multi-Metrics Learning for Speech Enhancement., , , and . CoRR, (2017)Ensemble environment modeling using affine transform group., , , and . Speech Commun., (2015)SapAugment: Learning A Sample Adaptive Policy for Data Augmentation., , , , , , , and . ICASSP, page 4040-4044. IEEE, (2021)Discriminative Fuzzy Clustering Maximum a Posterior Linear Regression for Speaker Adaptation., , and . INTERSPEECH, page 567-570. ISCA, (2012)Ensemble of machine learning and acoustic segment model techniques for speech emotion and autism spectrum disorders recognition., , , , , , and . INTERSPEECH, page 215-219. ISCA, (2013)Subspace Representation Learning for Few-shot Image Classification., , and . CoRR, (2021)Pose Guided Person Image Generation With Hidden P-Norm Regression., and . ICIP, page 2423-2427. IEEE, (2021)SYNT++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition., , , , , and . ICASSP, page 7682-7686. IEEE, (2022)I See What You Hear: A Vision-Inspired Method to Localize Words., , , , , , , and . ICASSP, page 1-5. IEEE, (2023)Project RISE: Recognizing Industrial Smoke Emissions., , , , , , , , , and . AAAI, page 14813-14821. AAAI Press, (2021)