Author of the publication

End-to-end audio-scene classification from raw audio: Multi time-frequency resolution CNN architecture for efficient representation learning.

, , , and . SPCOM, page 1-5. IEEE, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Study on the Importance of Formant Transitions for Stop-Consonant Classification in VCV Sequence., , , and . INTERSPEECH, page 4518-4522. ISCA, (2023)An Investigation of the Virtual Lip Trajectories During the Production of Bilabial Stops and Nasal at Different Speaking Rates., and . INTERSPEECH, page 1401-1405. ISCA, (2020)SPIRE VCV: An Acoustic-Articulatory Corpus with Three Different Speaking Rates., , , , and . O-COCOSDA, page 116-121. IEEE, (2021)End-to-end audio-scene classification from raw audio: Multi time-frequency resolution CNN architecture for efficient representation learning., , , and . SPCOM, page 1-5. IEEE, (2020)Towards Learning Emotion Information from Short Segments of Speech., , , , and . ICASSP, page 1-5. IEEE, (2023)Impact of Speaking Rate on the Source Filter Interaction in Speech: A Study., , and . ICASSP, page 6448-6452. IEEE, (2021)Comparing Biosignal and Acoustic feature Representation for Continuous Emotion Recognition., , , , and . MuSe @ ACM Multimedia, page 37-45. ACM, (2022)Component-specific temporal decomposition: application to enhanced speech coding and co-articulation analysis., and . SPCOM, page 1-5. IEEE, (2020)Implicit phonetic information modeling for speech emotion recognition., , and . INTERSPEECH, page 1883-1887. ISCA, (2023)