Mahout currently has
Collaborative Filtering
User and Item based recommenders
K-Means, Fuzzy K-Means clustering
Mean Shift clustering
Dirichlet process clustering
Latent Dirichlet Allocation
Singular value decomposition
Parallel Frequent Pattern mining
Complementary Naive Bayes classifier
Random forest decision tree based classifier
High performance java collections (previously colt collections)
A vibrant community
and many more cool stuff to come by this summer thanks to Google summer of code
I'm interested in machine learning techniques (graphical models, kernel methods) applied to text understanding (entity and relation extraction, coreference resolution, document classification and clustering, confidence prediction, social network analysis, data mining).
R. Kohavi. Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, стр. 1137-1145. San Mateo, CA: Morgan Kaufmann, (1995)
D. Nguyen, N. Smith, и C. Rosé. Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, стр. 115--123. Stroudsburg, PA, USA, Association for Computational Linguistics, (2011)
W. Lee. ICML '01: Proceedings of the Eighteenth International Conference on Machine Learning, стр. 314--321. San Francisco, CA, USA, Morgan Kaufmann Publishers Inc., (2001)
P. Cimiano, A. Hotho, и S. Staab. Proceedings of the European Conference on Artificial Intelligence (ECAI'04), стр. 435-439. Valencia, Spain, IOS Press, (2004)
L. Schmidt-Thieme. Proceedings of the 5th IEEE International Conference on Data Mining (ICDM 2005), 27-30 November 2005, стр. 378-385. Houston, Texas, USA, IEEE Computer Society, (2005)
A. Plangprasopchok, и K. Lerman. WWW '09: Proceedings of the 18th international conference on World wide web, стр. 781--790. New York, NY, USA, ACM, (2009)
P. Turney. ACL-44: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, стр. 313--320. Morristown, NJ, USA, Association for Computational Linguistics, (2006)
G. Wu, E. Chang, и N. Panda. KDD '05: Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, стр. 703--709. New York, NY, USA, ACM Press, (2005)