Following up on KMeans Clustering Now Running on Elastic MapReduce, Stephen Green has generously documented the steps that was necessary to get an example of k-Means clustering up and running on Amazon’s Elastic MapReduce (EMR) on the Apache Lucene Mahout wiki.
S. Basu, A. Banerjee, и R. Mooney. Proceedings of the 2004 SIAM International Conference on Data Mining, стр. 333--344. Lake Buena Vista, FL, Society for Industrial and Applied Mathematics, (апреля 2004)