copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Multi-modal information retrieval from broadcast video using OCR and speech recognition

A. Hauptmann, R. Jin, and T. Ng. JCDL'02: Proceedings of the 2nd ACM/IEEE-CS Joint Conference on Digital Libraries, page 160--161. (2002)

Abstract

We examine multi-modal information retrieval from broadcast video where text can be read on the screen through OCR and speech recognition can be performed on the audio track. OCR and speech recognition are compared on the 2001 TREC Video Retrieval evaluation corpus. Results show that OCR is more important that speech recognition for video retrieval. OCR retrieval can further improve through dictionary-based post-processing. We demonstrate how to utilize imperfect multi-modal metadata results to benefit multi-modal information retrieval.

Links and resources

BibTeX key: HJN02
entry type: inproceedings
booktitle: JCDL'02: Proceedings of the 2nd ACM/IEEE-CS Joint Conference on Digital Libraries
year: 2002
pages: 160--161
series: Video and multimedia digital libraries
mrnumber: C.DL.02.160
url: http://doi.acm.org/10.1145/544220.544252

@lysander07's tags highlighted

Cite this publication

search on

Meta data

Last update 16 years ago
Created 18 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Multi-modal information retrieval from broadcast video using OCR and speech recognition

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Multi-modal information retrieval from broadcast video using OCR and speech recognition

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Multi-modal information retrieval from broadcast video using OCR and speech recognition

Comments and Reviews
(0)