Following up on KMeans Clustering Now Running on Elastic MapReduce, Stephen Green has generously documented the steps that was necessary to get an example of k-Means clustering up and running on Amazon’s Elastic MapReduce (EMR) on the Apache Lucene Mahout wiki.
A. Hotho, A. Maedche, und S. Staab. ICDM '01: Proceedings of the 2001 IEEE International Conference on Data Mining, Seite 607--608. Washington, DC, USA, IEEE Computer Society, (2001)
J. MacQueen. Proc. of the fifth Berkeley Symposium on Mathematical Statistics and Probability, 1, Seite 281-297. University of California Press, (1967)
I. Yoo, und X. Hu. JCDL '06: Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries, Seite 220--229. New York, NY, USA, ACM Press, (2006)
Y. Zhao, und G. Karypis. CIKM '02: Proceedings of the eleventh international conference on Information and knowledge management, Seite 515--524. New York, NY, USA, ACM Press, (2002)