Mahout currently has
Collaborative Filtering
User and Item based recommenders
K-Means, Fuzzy K-Means clustering
Mean Shift clustering
Dirichlet process clustering
Latent Dirichlet Allocation
Singular value decomposition
Parallel Frequent Pattern mining
Complementary Naive Bayes classifier
Random forest decision tree based classifier
High performance java collections (previously colt collections)
A vibrant community
and many more cool stuff to come by this summer thanks to Google summer of code
I'm interested in machine learning techniques (graphical models, kernel methods) applied to text understanding (entity and relation extraction, coreference resolution, document classification and clustering, confidence prediction, social network analysis, data mining).
D. Schlör, J. Pfister, and A. Hotho. 2023 the 7th International Conference on Medical and Health Informatics (ICMHI), page 136–141. New York, NY, USA, Association for Computing Machinery, (2023)
L. Silva, and L. Jayaratne. Applications of Digital Information and Web Technologies, 2009. ICADIWT '09. Second International Conference on the, page 446-451. (August 2009)
S. Rudolph, J. Völker, and P. Hitzler. Proceedings of the 15th International Conference on Conceptual Structures (ICCS 2007), volume 4604 of Lecture Notes in Artificial Intelligence, page 488-491. Berlin, Heidelberg, Springer-Verlag, (July 2007)
X. Zhang, and Y. LeCun. (2015)cite arxiv:1502.01710Comment: This technical report is superseded by a paper entitled "Character-level Convolutional Networks for Text Classification", arXiv:1509.01626. It has considerably more experimental results and a rewritten introduction.