MALLET is an integrated collection of Java code useful for statistical natural language processing, document classification, clustering, information extraction, and other machine learning applications to text.
P. Kluegl, M. Toepfer, F. Lemmerich, A. Hotho, and F. Puppe. Mathematical Methodologies in Pattern Recognition and Machine Learning Springer Proceedings in Mathematics & Statistics, (2013)
M. Kayed, and K. Shaalan. IEEE Transactions on Knowledge and Data Engineering, 18 (10):
1411--1428(2006)Member-Chia-Hui Chang and Member-Moheb Ramzy Girgis.
J. Lafferty, A. McCallum, and F. Pereira. Proc. 18th International Conf. on Machine Learning, page 282--289. Morgan Kaufmann, San Francisco, CA, (2001)
H. Han, C. Giles, E. Manavoglu, H. Zha, Z. Zhang, and E. Fox. JCDL '03: Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries, page 37--48. Washington, DC, USA, IEEE Computer Society, (2003)
Y. Zhou, L. Wu, F. Weng, and H. Schmidt. Proceedings of the 2003 conference on Empirical methods in natural language processing, page 153--159. Morristown, NJ, USA, Association for Computational Linguistics, (2003)