copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

2013 Fourth International Conference on Advances in Computer Science

D. S, and D. Das (Eds.) (2013)

Abstract

With the rapid growth of information technology and in many business applications, mining frequent patterns and finding associations among them requires handling large and distributed databases. As FP-tree considered being the best compact data structure to hold the data patterns in memory there has been efforts to make it parallel and distributed to handle large databases. However, it incurs lot of communication over head during the mining. In this paper parallel and distributed frequent pattern mining algorithm using Hadoop Map Reduce framework is proposed, which shows best performance results for large databases. Proposed algorithm partitions the database in such a way that, it works independently at each local node and locally generates the frequent patterns by sharing the global frequent pattern header table. These local frequent patterns are merged at final stage. This reduces the complete communication overhead during structure construction as well as during pattern mining. The item set count is also taken into consideration reducing processor idle time. Hadoop Map Reduce framework is used effectively in all the steps of the algorithm. Experiments are carried out on a PC cluster with 5 computing nodes which shows execution time efficiency as compared to other algorithms. The experimental result shows that proposed algorithm efficiently handles the scalability for very large datab ases.

Links and resources

BibTeX key: s2013distributed
entry type: conference
booktitle: Distributed Algorithm for Frequent Pattern Mining using HadoopMap Reduce Framework
year: 2013
organization: Institute of Doctors Engineers and Scientists
publisher: ACEEE (A Computer division of IDES)
Document: http://searchdl.org/public/conference/2013/AETACS/123.pdf

Cite this publication

@conference{s2013distributed, abstract = {With the rapid growth of information technology and in many business applications, mining frequent patterns and finding associations among them requires handling large and distributed databases. As FP-tree considered being the best compact data structure to hold the data patterns in memory there has been efforts to make it parallel and distributed to handle large databases. However, it incurs lot of communication over head during the mining. In this paper parallel and distributed frequent pattern mining algorithm using Hadoop Map Reduce framework is proposed, which shows best performance results for large databases. Proposed algorithm partitions the database in such a way that, it works independently at each local node and locally generates the frequent patterns by sharing the global frequent pattern header table. These local frequent patterns are merged at final stage. This reduces the complete communication overhead during structure construction as well as during pattern mining. The item set count is also taken into consideration reducing processor idle time. Hadoop Map Reduce framework is used effectively in all the steps of the algorithm. Experiments are carried out on a PC cluster with 5 computing nodes which shows execution time efficiency as compared to other algorithms. The experimental result shows that proposed algorithm efficiently handles the scalability for very large datab ases.}, added-at = {2014-02-04T07:55:40.000+0100}, biburl = {https://www.bibsonomy.org/bibtex/233a87810ac28b3e93e4bdd6fa70046e3/idescitation}, booktitle = {Distributed Algorithm for Frequent Pattern Mining using HadoopMap Reduce Framework}, editor = {S, Dr. Harish B and Das, Dr. Vinu V}, interhash = {d0a9093dbf38877dab03dbd8e80a7e99}, intrahash = {33a87810ac28b3e93e4bdd6fa70046e3}, keywords = {Distributed Parallel computing processing}, organization = {Institute of Doctors Engineers and Scientists}, publisher = {ACEEE (A Computer division of IDES)}, timestamp = {2014-02-04T07:55:40.000+0100}, title = {2013 Fourth International Conference on Advances in Computer Science}, url = {http://searchdl.org/public/conference/2013/AETACS/123.pdf}, year = 2013 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

2013 Fourth International Conference on Advances in Computer Science

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML 2013 Fourth International Conference on Advances in Computer Science

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

2013 Fourth International Conference on Advances in Computer Science

Comments and Reviews
(0)