R. Swan, und J. Allan. Annual ACM Conference on Research and Development in Information Retrieval (SIGIR), (2000)
Zusammenfassung
We present a statistical model of feature occurrence over time, and develop tests based on classical hypothesis testing for significance of term appearance on a given date. Using additional classical hypothesis testing we are able to combine these terms to generate “topics” as defined by the Topic Detection and Tracking study. The groupings of terms obtained can be used to automatically generate an interactive timeline displaying the major events and topics covered by the corpus. To test the validity of our technique we extracted a large number of these topics from a test corpus and had human evaluators judge how well the selected features captured the gist of the topics, and how they overlapped with a set of known topics from the corpus. The resulting topics were highly rated by evaluators who compared them to known topics.
%0 Journal Article
%1 Swan2000
%A Swan, Russell
%A Allan, James
%D 2000
%J Annual ACM Conference on Research and Development in Information Retrieval (SIGIR)
%K UIs\&slash,event and collection data detection erc event_detection for mining,visualization models,statistical\&slash,text overviews tracking,probabilistic
%P 49
%T Automatic generation of overview timelines
%U http://portal.acm.org/citation.cfm?id=345546
%X We present a statistical model of feature occurrence over time, and develop tests based on classical hypothesis testing for significance of term appearance on a given date. Using additional classical hypothesis testing we are able to combine these terms to generate “topics” as defined by the Topic Detection and Tracking study. The groupings of terms obtained can be used to automatically generate an interactive timeline displaying the major events and topics covered by the corpus. To test the validity of our technique we extracted a large number of these topics from a test corpus and had human evaluators judge how well the selected features captured the gist of the topics, and how they overlapped with a set of known topics from the corpus. The resulting topics were highly rated by evaluators who compared them to known topics.
@article{Swan2000,
abstract = {We present a statistical model of feature occurrence over time, and develop tests based on classical hypothesis testing for significance of term appearance on a given date. Using additional classical hypothesis testing we are able to combine these terms to generate “topics” as defined by the Topic Detection and Tracking study. The groupings of terms obtained can be used to automatically generate an interactive timeline displaying the major events and topics covered by the corpus. To test the validity of our technique we extracted a large number of these topics from a test corpus and had human evaluators judge how well the selected features captured the gist of the topics, and how they overlapped with a set of known topics from the corpus. The resulting topics were highly rated by evaluators who compared them to known topics.},
added-at = {2012-09-17T17:48:52.000+0200},
author = {Swan, Russell and Allan, James},
biburl = {https://www.bibsonomy.org/bibtex/2b91824b36000b830b6fdeb1af9261663/lillejul},
interhash = {dd6cec518f1066160b1d973a009dd813},
intrahash = {b91824b36000b830b6fdeb1af9261663},
journal = {Annual ACM Conference on Research and Development in Information Retrieval (SIGIR)},
keywords = {UIs\&slash,event and collection data detection erc event_detection for mining,visualization models,statistical\&slash,text overviews tracking,probabilistic},
pages = 49,
timestamp = {2012-09-17T17:54:01.000+0200},
title = {{Automatic generation of overview timelines}},
url = {http://portal.acm.org/citation.cfm?id=345546},
year = 2000
}