This project contains Naive and Fishers bayesian classifiers, as described in Toby Segaran's book "Programming Collective Intelligence." The book has python implementations; this is a Java implementation.
Libtextcat is a library with functions that implement the classification technique described in Cavnar & Trenkle, "N-Gram-Based Text Categorization" [1]. It was primarily developed for language guessing, a task on which it is known to perform with near-pe
A. Almal, A. Mitra, R. Datar, P. Lenehan, D. Fry, R. Cote, und W. Worzel. GECCO 2006: Proceedings of the 8th annual conference
on Genetic and evolutionary computation, 1, Seite 239--246. Seattle, Washington, USA, ACM Press, (8-12 July 2006)