Abstract
Today, among the best-performing audio-based music simi-
larity measures are algorithms based on Mel Frequency Cepstrum Coef-
ficients (MFCCs). In these algorithms, each music track is modelled as a
Gaussian Mixture Model (GMM) of MFCCs. The similarity between two
tracks is computed by comparing their GMMs. One drawback of this ap-
proach is that the distance space obtained this way has some undesirable
properties.
In this paper, a number of approaches to correct these undesirable prop-
erties are investigated. They use knowledge about the properties of music
by using other music tracks as a reference. These reference tracks can
either be the music collection itself, or they may be an external set of
reference tracks.
Our results show that the proposed techniques clearly improve the qual-
ity of this audio similarity measure. Furthermore, preliminary experi-
ments indicate that the techniques also help to improve other similarity
measures. They may even be useful in completely different domains, most
notably text information retrieval.
Users
Please
log in to take part in the discussion (add own reviews or comments).