@claudio.lucchese

Domain and data partitioning for parallel mining of frequent closed itemsets

, , and . ACM-SE 43: Proceedings of the 43rd annual Southeast regional conference, page 250--255. New York, NY, USA, ACM, (2005)
DOI: http://doi.acm.org/10.1145/1167350.1167423

Abstract

In this paper, we propose an algorithm to partition both the search space and the database for the parallel mining of frequent closed itemsets in large databases. The partitioning of the search space is based on splitting the power set lattice of the total item set to two sub-lattices. Conditional databases axe used to partition the large database. The combination of the search space and database partitioning allows parallel processors to mine the frequent closed itemsets independently and thus minimizes the interprocessor communication and synchronization. The partitioning also ensures the load balance among the parallel processors.

Description

Domain and data partitioning for parallel mining of frequent closed itemsets

Links and resources

Tags

community

  • @dblp
  • @claudio.lucchese
@claudio.lucchese's tags highlighted