copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Discovering objects and their location in images

J. Sivic, B. Russell, A. Efros, A. Zisserman, and W. Freeman. 1, page 370--377. (2005)

Abstract

We seek to discover the object categories depicted in a set of unlabelled images. We achieve this using a model developed in the statistical text literature: probabilistic latent semantic analysis (pLSA). In text analysis, this is used to discover topics in a corpus using the bag-of-words document representation. Here we treat object categories as topics, so that an image containing instances of several categories is modeled as a mixture of topics. The model is applied to images by using a visual analogue of a word, formed by vector quantizing SIFT-like region descriptors. The topic discovery approach successfully translates to the visual domain: for a small set of objects, we show that both the object categories and their approximate spatial layout are found without supervision. Performance of this unsupervised method is compared to the supervised approach of Fergus et al. (2003) on a set of unseen images containing only one object per image. We also extend the bag-of-words vocabulary to include 'doublets' which encode spatially local co-occurring regions. It is demonstrated that this extended vocabulary gives a cleaner image segmentation. Finally, the classification and segmentation methods are applied to a set of images containing multiple objects per image. These results demonstrate that we can successfully build object class models from an unsupervised analysis of images.

Links and resources

BibTeX key: citeulike:1321552
entry type: inproceedings
year: 2005
journal: Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on
pages: 370--377
volume: 1
priority: 3
citeulike-article-id: 1321552
url: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=1541280

Cite this publication

@inproceedings{citeulike:1321552, abstract = {We seek to discover the object categories depicted in a set of unlabelled images. We achieve this using a model developed in the statistical text literature: probabilistic latent semantic analysis (pLSA). In text analysis, this is used to discover topics in a corpus using the bag-of-words document representation. Here we treat object categories as topics, so that an image containing instances of several categories is modeled as a mixture of topics. The model is applied to images by using a visual analogue of a word, formed by vector quantizing SIFT-like region descriptors. The topic discovery approach successfully translates to the visual domain: for a small set of objects, we show that both the object categories and their approximate spatial layout are found without supervision. Performance of this unsupervised method is compared to the supervised approach of Fergus et al. (2003) on a set of unseen images containing only one object per image. We also extend the bag-of-words vocabulary to include 'doublets' which encode spatially local co-occurring regions. It is demonstrated that this extended vocabulary gives a cleaner image segmentation. Finally, the classification and segmentation methods are applied to a set of images containing multiple objects per image. These results demonstrate that we can successfully build object class models from an unsupervised analysis of images.}, added-at = {2007-05-29T08:50:30.000+0200}, author = {Sivic, J. and Russell, B. C. and Efros, A. A. and Zisserman, A. and Freeman, W. T.}, biburl = {https://www.bibsonomy.org/bibtex/2d93f5d020b4880e60f1c2920217c5c7e/kzhou}, citeulike-article-id = {1321552}, interhash = {b8449985743cc4fcd1214adf084d359a}, intrahash = {d93f5d020b4880e60f1c2920217c5c7e}, journal = {Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on}, keywords = {image_classification learning part_based_analysis}, pages = {370--377}, priority = {3}, timestamp = {2007-05-29T08:50:30.000+0200}, title = {Discovering objects and their location in images}, url = {http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=1541280}, volume = 1, year = 2005 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Discovering objects and their location in images

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Discovering objects and their location in images

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Discovering objects and their location in images

Comments and Reviews
(0)