Artikel,

Large scale image annotation: learning to rank with joint word-image embeddings

J. Weston, S. Bengio, und N. Usunier.
Machine Learning, (2010)
DOI: 10.1007/s10994-010-5198-3

Zusammenfassung

Image annotation datasets are becoming larger and larger, with tens of millions of images and tens of thousands of possible annotations. We propose a strongly performing method that scales to such datasets by simultaneously learning to optimize precision at k of the ranked list of annotations for a given image and learning a low-dimensional joint embedding space for both images and annotations. Our method both outperforms several baseline methods and, in comparison to them, is faster and consumes less memory. We also demonstrate how our method learns an interpretable model, where annotations with alternate spellings or even languages are close in the embedding space. Hence, even when our model does not predict the exact annotation given by a human labeler, it often predicts similar annotations, a fact that we try to quantify by measuring the newly introduced sibling precision metric, where our method also obtains excellent results.

BibTeX-Schlüssel: Weston10imageAnnotation
Eintragstyp: article
Jahr: 2010
Zeitschrift: Machine Learning
Seiten: 21-35
Verlag: Springer Netherlands
Band: 81
issn: 0885-6125
issue: 1
keyword: Computer Science
affiliation: Google, New York, USA
DOI: 10.1007/s10994-010-5198-3
URL: http://dx.doi.org/10.1007/s10994-010-5198-3

BibSonomy

Large scale image annotation: learning to rank with joint word-image embeddings

Zusammenfassung

Tags

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf