tomotopy is a Python extension of tomoto (Topic Modeling Tool) which is a Gibbs-sampling based topic model library written in C++. It utilizes a vectorization of modern CPUs for maximizing speed. The current version of tomoto supports several major topic models including
In natural language understanding, there is a hierarchy of lenses through which we can extract meaning - from words to sentences to paragraphs to documents. At the document level, one of the most useful ways to understand text is by analyzing its topics.
R. Mehrotra, S. Sanner, W. Buntine, и L. Xie. Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval, стр. 889--892. ACM, (2013)
L. Dietz, S. Bickel, и T. Scheffer. Proceedings of the 24th international conference on Machine learning, стр. 233--240. New York, NY, USA, ACM, (2007)