Abstract
This paper presents an automatic system for structuring and
preparing a news broadcast for applications such as speech summarization,
browsing, archiving and information retrieval. This
process comprises transcribing the audio using an automatic
speech recognizer and subsequently segmenting the text into utterances
and topics. A maximum entropy approach is used to build
statistical models for both utterance and topic segmentation. The
experimental work addresses the effect on performance of the topic
boundary detector of three factors: the information sources used,
the quality of the ASR transcripts, and the quality of the utterance
boundary detector. The results show that the topic segmentation
is not affected severely by transcripts errors, whereas errors in the
utterance segmentation are more devastating.
Users
Please
log in to take part in the discussion (add own reviews or comments).