Mahout currently has
Collaborative Filtering
User and Item based recommenders
K-Means, Fuzzy K-Means clustering
Mean Shift clustering
Dirichlet process clustering
Latent Dirichlet Allocation
Singular value decomposition
Parallel Frequent Pattern mining
Complementary Naive Bayes classifier
Random forest decision tree based classifier
High performance java collections (previously colt collections)
A vibrant community
and many more cool stuff to come by this summer thanks to Google summer of code
I'm interested in machine learning techniques (graphical models, kernel methods) applied to text understanding (entity and relation extraction, coreference resolution, document classification and clustering, confidence prediction, social network analysis, data mining).
R. Kohavi. Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, Seite 1137-1145. San Mateo, CA: Morgan Kaufmann, (1995)
D. Nguyen, N. Smith, und C. Rosé. Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, Seite 115--123. Stroudsburg, PA, USA, Association for Computational Linguistics, (2011)