A semantic case-based reasoning framework for text categorization
V. Ceausu, and S. Desprès. Proceedings of the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007), Busan, South Korea, volume 4825 of LNCS, page 729--742. Berlin, Heidelberg, Springer Verlag, (November 2007)
Abstract
This paper presents a semantic case-based reasoning framework for text categorization. Text categorization is the task of classifying text documents under predened categories. Accidentology is our application eld and the goal of our framework is to classify documents describing real road accidents under predened road accident prototpypes, which also are described by text documents. Accidents are described by accident reports while accident prototypes are described by accident scenarios. Thus, text categorization is done by assigning each accident report to an accident scenario, which highlights particular mechanisms leading to accident.
We propose a textual case based reasoning approach (TCBR), which allows us to integrate both textual and domain knowledge aspects inorder to carry out this categorization. CBR solves a new problem (target case) by identifying its similarity to one or several previously solved problems (source cases) stored in a case base and by adapting their known solutions. Cases of our framework are created from text. Most of TCBR applications create cases from text by using Information Retrieval techniques, which leads to knowledge-poor descriptions of cases. We show that using semantic resources (two ontology of accidentology) makes possible to overcome this diculty, and allows us to enrich cases by using formal knowledge.
In this paper, we argue that semantic resources are likely to improve the quality of cases created from text, and, therefore, such resources can support the reasoning cycle. We illustrate this claim with our framework developed to classify documents in the accidentology domain.
%0 Conference Paper
%1 Ceausu/2007/semantic
%A Ceausu, Valentina
%A Desprès, Sylvie
%B Proceedings of the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007), Busan, South Korea
%C Berlin, Heidelberg
%D 2007
%E Aberer, Karl
%E Choi, Key-Sun
%E Noy, Natasha
%E Allemang, Dean
%E Lee, Kyung-Il
%E Nixon, Lyndon J B
%E Golbeck, Jennifer
%E Mika, Peter
%E Maynard, Diana
%E Schreiber, Guus
%E Cudré-Mauroux, Philippe
%I Springer Verlag
%K 2007 categorization framework in_use_4 iswc reasoning semantic text
%P 729--742
%T A semantic case-based reasoning framework for text categorization
%U http://iswc2007.semanticweb.org/papers/729.pdf
%V 4825
%X This paper presents a semantic case-based reasoning framework for text categorization. Text categorization is the task of classifying text documents under predened categories. Accidentology is our application eld and the goal of our framework is to classify documents describing real road accidents under predened road accident prototpypes, which also are described by text documents. Accidents are described by accident reports while accident prototypes are described by accident scenarios. Thus, text categorization is done by assigning each accident report to an accident scenario, which highlights particular mechanisms leading to accident.
We propose a textual case based reasoning approach (TCBR), which allows us to integrate both textual and domain knowledge aspects inorder to carry out this categorization. CBR solves a new problem (target case) by identifying its similarity to one or several previously solved problems (source cases) stored in a case base and by adapting their known solutions. Cases of our framework are created from text. Most of TCBR applications create cases from text by using Information Retrieval techniques, which leads to knowledge-poor descriptions of cases. We show that using semantic resources (two ontology of accidentology) makes possible to overcome this diculty, and allows us to enrich cases by using formal knowledge.
In this paper, we argue that semantic resources are likely to improve the quality of cases created from text, and, therefore, such resources can support the reasoning cycle. We illustrate this claim with our framework developed to classify documents in the accidentology domain.
@inproceedings{Ceausu/2007/semantic,
abstract = {This paper presents a semantic case-based reasoning framework for text categorization. Text categorization is the task of classifying text documents under predened categories. Accidentology is our application eld and the goal of our framework is to classify documents describing real road accidents under predened road accident prototpypes, which also are described by text documents. Accidents are described by accident reports while accident prototypes are described by accident scenarios. Thus, text categorization is done by assigning each accident report to an accident scenario, which highlights particular mechanisms leading to accident.
We propose a textual case based reasoning approach (TCBR), which allows us to integrate both textual and domain knowledge aspects inorder to carry out this categorization. CBR solves a new problem (target case) by identifying its similarity to one or several previously solved problems (source cases) stored in a case base and by adapting their known solutions. Cases of our framework are created from text. Most of TCBR applications create cases from text by using Information Retrieval techniques, which leads to knowledge-poor descriptions of cases. We show that using semantic resources (two ontology of accidentology) makes possible to overcome this diculty, and allows us to enrich cases by using formal knowledge.
In this paper, we argue that semantic resources are likely to improve the quality of cases created from text, and, therefore, such resources can support the reasoning cycle. We illustrate this claim with our framework developed to classify documents in the accidentology domain.},
added-at = {2007-11-07T19:13:58.000+0100},
address = {Berlin, Heidelberg},
author = {Ceausu, Valentina and Desprès, Sylvie},
biburl = {https://www.bibsonomy.org/bibtex/2d2d06a40d5dc20da24b6ae5e20073ec5/iswc2007},
booktitle = {Proceedings of the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007), Busan, South Korea},
crossref = {http://data.semanticweb.org/conference/iswc-aswc/2007/proceedings},
editor = {Aberer, Karl and Choi, Key-Sun and Noy, Natasha and Allemang, Dean and Lee, Kyung-Il and Nixon, Lyndon J B and Golbeck, Jennifer and Mika, Peter and Maynard, Diana and Schreiber, Guus and Cudré-Mauroux, Philippe},
interhash = {fbf773aa9dd1fe58732aa8f7963900e3},
intrahash = {d2d06a40d5dc20da24b6ae5e20073ec5},
keywords = {2007 categorization framework in_use_4 iswc reasoning semantic text},
month = {November},
pages = {729--742},
publisher = {Springer Verlag},
series = {LNCS},
timestamp = {2007-11-07T19:20:54.000+0100},
title = {A semantic case-based reasoning framework for text categorization},
url = {http://iswc2007.semanticweb.org/papers/729.pdf},
volume = 4825,
year = 2007
}