The Natural Programming Project is working on making programming languages and environments easier to learn, more effective, and less error prone. We are taking a human-centered approach, first studying how people perform their tasks and then designing languages and environments around people's natural tendencies. We focus on all kinds of programming, including professional programmers, novice programmers who are trying to learn to be experts, and end users, who program to support other jobs or hobbies, such as multimedia authoring, simulations, teaching, prototyping, and other activities supported by computing.
Speech technology potentially allows everyone to participate in today's information revolution and can bridge the language barrier gap. Unfortunately, construction of speech processing systems requires significant resources. With some 6900 languages in the world, traditionally speech processing is prohibitive to all but the most economically viable languages. In spite of recent improvements in speech processing, supporting new languages is a skilled job requiring significant effort from trained individuals. SPICE aims to overcome both limitations by providing an interactive language creation and evaluation toolkit that allows everyone to develop speech processing models, to collect appropriate data for model building, and to evaluate the results enabling iterative improvements.
NGramJ is a Java based library containing two types of ngram based applications. It's major focus is to provide robust and state of the art language recognition.
Stanford CoreNLP provides a set of natural language analysis tools. It can give the base forms of words, their parts of speech, whether they are names of companies, people, etc., normalize dates, times, and numeric quantities, and mark up the structure of sentences in terms of phrases and word dependencies, indicate which noun phrases refer to the same entities, indicate sentiment, extract open-class relations between mentions, etc.
Libtextcat is a library with functions that implement the classification technique described in Cavnar & Trenkle, "N-Gram-Based Text Categorization" [1]. It was primarily developed for language guessing, a task on which it is known to perform with near-pe
M. Schwab, R. Jäschke, and F. Fischer. Proceedings of the 6th International Conference on Natural Language and Speech Processing, page 99--109. Association for Computational Linguistics, (2023)
P. Xia, S. Wu, and B. Van Durme. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), page 7516--7533. Association for Computational Linguistics, (November 2020)
L. Flek. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, page 7828--7838. Online, Association for Computational Linguistics, (July 2020)
S. Blodgett, S. Barocas, H. Daumé III, and H. Wallach. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, page 5454--5476. Online, Association for Computational Linguistics, (July 2020)
M. Peters, M. Neumann, R. Logan, R. Schwartz, V. Joshi, S. Singh, and N. Smith. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), page 43--54. Hong Kong, China, Association for Computational Linguistics, (November 2019)
S. Bordia, and S. Bowman. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, page 7--15. Minneapolis, Minnesota, Association for Computational Linguistics, (June 2019)
G. Muzny, M. Fang, A. Chang, and D. Jurafsky. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers, page 460--470. Valencia, Spain, Association for Computational Linguistics, (April 2017)
C. Scheible, R. Klinger, and S. Padó. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), page 1736--1745. Berlin, Germany, Association for Computational Linguistics, (August 2016)
A. Hadgu, N. Lotze, and R. Jäschke. Proceedings of the Workshop on Natural Language Processing and Computational Social Science, Hannover, Germany, (May 2016)
M. Yahya, K. Berberich, S. Elbassuoni, M. Ramanath, V. Tresp, and G. Weikum. Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, page 379--390. Stroudsburg, PA, USA, Association for Computational Linguistics, (2012)
W. Barczynski, F. Foester, F. Brauer, and D. Schuster. to appear in: Proceedings of the 12th International Conference on Enterprise Information Systems (ICEIS-2010), Funchal, Portugal, (8 - 12 June 2010)
R. Snow, B. O'Connor, D. Jurafsky, and A. Ng. Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, page 254--263. Honolulu, Hawaii, Association for Computational Linguistics, (October 2008)