Author of the publication

Multimodal and Multiresolution Speech Recognition with Transformers.

, , , and . ACL, page 2381-2387. Association for Computational Linguistics, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Multimodal and Multiresolution Speech Recognition with Transformers., , , and . ACL, page 2381-2387. Association for Computational Linguistics, (2020)ADMM-DAD Net: A Deep Unfolding Network for Analysis Compressed Sensing., , , and . ICASSP, page 1506-1510. IEEE, (2022)Regotron: Regularizing the Tacotron2 Architecture Via Monotonic Alignment Loss., , , , , and . SLT, page 977-983. IEEE, (2022)NTUA-SLP at SemEval-2018 Task 1: Predicting Affective Content in Tweets with Deep Attentive RNNs and Transfer Learning., , , , , , , and . SemEval@NAACL-HLT, page 245-255. Association for Computational Linguistics, (2018)Meltemi: The first open Large Language Model for Greek., , , , , , , , and . CoRR, (2024)The Greek podcast corpus: Competitive speech models for low-resourced languages with weakly supervised data., , , and . CoRR, (2024)NTUA-SLP at SemEval-2018 Task 3: Tracking Ironic Tweets using Ensembles of Word and Character Level Attentive RNNs., , , , , , and . SemEval@NAACL-HLT, page 613-621. Association for Computational Linguistics, (2018)Integrating Recurrence Dynamics for Speech Emotion Recognition., , , and . INTERSPEECH, page 927-931. ISCA, (2018)Weakly-supervised forced alignment of disfluent speech using phoneme-level modeling., , , and . INTERSPEECH, page 1563-1567. ISCA, (2023)M3: MultiModal Masking Applied to Sentiment Analysis., , and . Interspeech, page 2876-2880. ISCA, (2021)