Gransk is a free and open source tool that aims to be a Swiss army knife of document processing and analysis. Its primary objective is to quikly provide users with insight to their documents during investigations. It includes a processing engine written in Python and a web interface. Under the hood it uses Apache Tika for content extraction, Elasticsearch for data indexing, and dfVFS to unpack disk images.
Paper Machines is a plugin for the Zotero bibliographic management software that makes cutting-edge topic-modeling analysis in Computer Science accessible to humanities researchers without requiring extensive computational resources or technical knowledge. It synthesizes several approaches to visualization within a highly accessible user interface.
R. Dogan, и Z. Lu. Proceedings of the 2012 Workshop on Biomedical Natural Language Processing, стр. 91--99. Stroudsburg, PA, USA, Association for Computational Linguistics, (2012)