Natural Language systems have evolved tremendously in the past few years from dealing only with small handcrafted examples to extremely large, real-world applications.
Die Tübinger Baumbank des Deutschen / Schriftsprache (TüBa-D/Z) ist ein syntaktisch annotiertes Korpus auf der Grundlage der Zeitung "die tageszeitung" (taz). Sie umfasst zur Zeit ca. 36 000 Sätze bzw. 630 000 Worte.
AI Related Ruby Extensions This page will maintain list of AI related extensions/modules/gems for the Ruby programming language. Please contact me if you know something I missed.
great open source software with various functionality for text and NLP support. Has components for rule and dictionary based extraction, co-reference analysis.
a suite of open source Python modules, data and documentation for research and development in natural language processing. NLTK contains Code supporting dozens of NLP tasks, along with 40 popular Corpora and extensive Documentation including a 375-page online Book. Distributions for Windows, Mac OSX and Linux are available.
OpenCyc is the open source version of the Cyc technology, the world's largest and most complete general knowledge base and commonsense reasoning engine.