Inproceedings,

Automatically Adapting the Structure of Audio Similarity Spaces

T. Pohle, M. Schedl, P. Knees, and G. Widmer.
Proceedings of the 1st Workshop on Learning the Semantics of Audio Signals, Athens, Greece, (2006)

Abstract

Today, among the best-performing audio-based music simi- larity measures are algorithms based on Mel Frequency Cepstrum Coef- ficients (MFCCs). In these algorithms, each music track is modelled as a Gaussian Mixture Model (GMM) of MFCCs. The similarity between two tracks is computed by comparing their GMMs. One drawback of this ap- proach is that the distance space obtained this way has some undesirable properties. In this paper, a number of approaches to correct these undesirable prop- erties are investigated. They use knowledge about the properties of music by using other music tracks as a reference. These reference tracks can either be the music collection itself, or they may be an external set of reference tracks. Our results show that the proposed techniques clearly improve the qual- ity of this audio similarity measure. Furthermore, preliminary experi- ments indicate that the techniques also help to improve other similarity measures. They may even be useful in completely different domains, most notably text information retrieval.

BibTeX key: Pohle:LSAS2006
entry type: inproceedings
address: Athens, Greece
booktitle: Proceedings of the 1st Workshop on Learning the Semantics of Audio Signals
year: 2006
location: Athens, Greece

BibSonomy

Automatically Adapting the Structure of Audio Similarity Spaces

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on