Beautiful visualizations of how language differs among document types. - GitHub - JasonKessler/scattertext: Beautiful visualizations of how language differs among document types.
markdown-mode is a major mode for editing Markdown-formatted
text files in GNU Emacs. markdown-mode is free software, licensed
under the GNU GPL.
Markdown is a text-to-HTML conversion tool for web writers. Markdown allows you to write using an easy-to-read, easy-to-write plain text format, then convert it to structurally valid XHTML (or HTML).
This is the project page for SecondString, an open-source Java-based package of approximate string-matching techniques. This code was developed by researchers at Carnegie Mellon University from the Center for Automated Learning and Discovery, the Department of Statistics, and the Center for Computer and Communications Security.
SecondString is intended primarily for researchers in information integration and other scientists. It does or will include a range of string-matching methods from a variety of communities, including statistics, artificial intelligence, information retrieval, and databases. It also includes tools for systematically evaluating performance on test data. It is not designed for use on very large data sets.
DadaDodo is a program that analyses texts for word probabilities, and then generates random sentences based on that. Sometimes these sentences are nonsense; but sometimes they cut right through to the heart of the matter, and reveal hidden meanings.
The nonsense which follows is a Markov Chain based upon patterns in some pieces of English text. Word-Unit Nonsense uses patterns about words that tend to follow one another. Character-Unit Nonsense uses letters.
P. Moreira, Y. Bizzoni, K. Nielbo, I. Lassen, and M. Thomsen. Proceedings of the The 5th Workshop on Narrative Understanding, page 25--35. Toronto, Canada, Association for Computational Linguistics, (July 2023)
Z. Yang, D. Yang, C. Dyer, X. He, A. Smola, and E. Hovy. Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, page 1480--1489. San Diego, California, Association for Computational Linguistics, (June 2016)
A. Nenkova, and R. Passonneau. Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics: HLT-NAACL 2004, page 145--152. Boston, Massachusetts, USA, Association for Computational Linguistics, (2004)
F. Arnold, and R. Jäschke. Proceedings of the Workshop on Natural Language Processing for Digital Humanities at ICON 2021, page 55--63. NLP Association of India, (2021)
S. Jänicke, T. Efer, M. Büchler, and G. Scheuermann. Computer Vision, Imaging and Computer Graphics - Theory and Applications, page 153--171. Cham, Springer International Publishing, (2015)
S. Jänicke, T. Efer, M. Büchler, and G. Scheuermann. Computer Vision, Imaging and Computer Graphics - Theory and Applications, page 153--171. Cham, Springer International Publishing, (2015)