Data Reduction Method for Categorical Data Clustering

Abstract

Categorical data clustering constitutes an important part of data mining; its relevance has recently drawn attention from several researchers. As a step in data mining, however, clustering encounters the problem of large amount of data to be processed. This article offers a solution for categorical clustering algorithms when working with high volumes of data by means of a method that summarizes the database. This is done using a structure called CM-tree. In order to test our method, the K-Modes and Click clustering algorithms were used with several databases. Experiments demonstrate that the proposed summarization method improves execution time, without losing clustering quality.

BibTeX key: Rendon2008
entry type: incollection
booktitle: Adv. Artif. Intell. – IBERAMIA 2008
year: 2008
pages: 143--152
DOI: http://dx.doi.org/10.1007/978-3-540-88309-8\_15
url: http://dx.doi.org/10.1007/978-3-540-88309-8\_15

BibSonomy

Data Reduction Method for Categorical Data Clustering

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on