Following up on KMeans Clustering Now Running on Elastic MapReduce, Stephen Green has generously documented the steps that was necessary to get an example of k-Means clustering up and running on Amazon’s Elastic MapReduce (EMR) on the Apache Lucene Mahout wiki.
S. Basu, A. Banerjee, und R. Mooney. Proceedings of the 2004 SIAM International Conference on Data Mining, Seite 333--344. Lake Buena Vista, FL, Society for Industrial and Applied Mathematics, (April 2004)
A. Hotho, A. Maedche, und S. Staab. ICDM '01: Proceedings of the 2001 IEEE International Conference on Data Mining, Seite 607--608. Washington, DC, USA, IEEE Computer Society, (2001)
A. Hotho, A. Maedche, und S. Staab. ICDM '01: Proceedings of the 2001 IEEE International Conference on Data Mining, Seite 607--608. Washington, DC, USA, IEEE Computer Society, (2001)
D. Cutting, D. Karger, J. Pedersen, und J. Tukey. SIGIR '92: Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval, Seite 318--329. New York, NY, USA, ACM Press, (1992)
J. MacQueen. Proc. of the fifth Berkeley Symposium on Mathematical Statistics and Probability, 1, Seite 281-297. University of California Press, (1967)
D. Arthur, und S. Vassilvitskii. SODA '07: Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms, Seite 1027--1035. Philadelphia, PA, USA, Society for Industrial and Applied Mathematics, (2007)
D. Arthur, und S. Vassilvitskii. SODA '07: Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms, Seite 1027--1035. Philadelphia, PA, USA, Society for Industrial and Applied Mathematics, (2007)
A. Hotho, A. Maedche, und S. Staab. ICDM '01: Proceedings of the 2001 IEEE International Conference on Data Mining, Seite 607--608. Washington, DC, USA, IEEE Computer Society, (2001)
I. Yoo, und X. Hu. JCDL '06: Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries, Seite 220--229. New York, NY, USA, ACM Press, (2006)
Y. Zhao, und G. Karypis. CIKM '02: Proceedings of the eleventh international conference on Information and knowledge management, Seite 515--524. New York, NY, USA, ACM Press, (2002)