This web page provides information, errata, as well as about a third of the chapters of the book Learning with Kernels, written by Bernhard Schölkopf and Alex Smola (MIT Press, Cambridge, MA, 2002).
LIBSVM is an integrated software for support vector classification, (C-SVC, nu-SVC), regression (epsilon-SVR, nu-SVR) and distribution estimation (one-class SVM ). It supports multi-class classification.
Alle Programme und Resourcen auf der Liste sind frei, d.h. kostenlos (für Forschungszwecke) verfügbar, auf deutschsprachige Texte anwendbar und sofort startklar, d.h. sie müssen nicht erst mit Hilfe von z.B. annotierten Korpora trainiert werden. Die Liste ist natürlich unvollständig (Stand 22.5.2007).
MegaMap is a Java implementation of a map (or hashtable) that can store an unbounded amount of data, limited only by the amount of disk space available. Objects stored in the map are persisted to disk. Good performance is achieved by an in-memory cache. The MegaMap can, for all practical reasons, be thought of as a map implementation with unlimited storage space.
MSTParser is a non-projective dependency parser that searches for maximum spanning trees over directed graphs. Models of dependency structure are based on large-margin discriminative training methods. Projective parsing is also supported.
MuNPEx is a multi-lingual noun phrase (NP) extraction component developed for the GATE architecture, implemented in JAPE. It currently supports English, German, French, and Spanish (in beta).
MuNPEx requires a part-of-speech (POS) tagger to work and can additionally use detected named entities (NEs) to improve chunking performance. Please read the documentation (or source code) for more details.
Das NEGRA Korpus Version 2 besteht aus 355.096 Tokens (20.602 Sätzen) deutschen Zeitungstextes aus der Frankfurter Rundschau. Die Texte sind der CD "Multilingual Corpus 1" der European Corpus Initiative entnommen. Es basiert auf ca. 60.000 Tokens, die am Institut für maschinelle Sprachverarbeitung, Stuttgart, mit Parts-of-Speech annotiert wurden. Dieses Korpus wurde erweitert, ebenfalls mit Parts-of-Speech versehen und vollständig mit syntaktischen Strukturen annotiert. Der Aufbau des Korpus wurde in den Projekten NEGRA (DFG Sonderforschungsbereich 378, Projekt C3) und LINC (Universität des Saarlandes) in Saarbrücken durchgeführt.
NestedVM provides binary translation for Java Bytecode. This is done by having GCC compile to a MIPS binary which is then translated to a Java class file. Hence any application written in C, C++, Fortran, or any other language supported by GCC can be run in 100% pure Java with no source changes.
Node.js provides a its own assert module with some really useful functions for creating basic tests. However, the reporting and running of these assertions can become complicated, especially with asynchronous code. How can you be sure that all assertions ran? Or that they ran in the correct order? This is where nodeunit comes in, a tool for defining and running unit tests in the simplest way possible.
M. Zhang, J. Zhang, J. Su, and G. Zhou. ACL-44: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, page 825--832. Morristown, NJ, USA, Association for Computational Linguistics, (2006)
R. Kate. Proceedings of the conference on Empirical Methods in Natural Language Processing (EMNLP-2008), page 400--409. Waikiki, Honolulu, Hawaii, (2008)
D. Roth, M. Sammons, and V. Vydiswaran. Proceedings of the ACL-IJCNLP 2009 Conference Short Papers, page 57--60. Suntec, Singapore, Association for Computational Linguistics, (August 2009)
D. Roth, and W. tau Yih. HLT-NAACL 2004 Workshop: Eighth Conference on Computational Natural Language Learning (CoNLL-2004), page 1--8. Boston, Massachusetts, USA, Association for Computational Linguistics, (May 2004)
M. Wang. Proceedings of the Third International Joint Conference on Natural Language Processing, 2, page 841--846. Hyderabad, India, Asian Federation of Natural Language Processing, Association for Computational Linguistics, (January 2008)
R. Bunescu, and R. Mooney. Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT '05), October 6-8, 2005, Vancouver, British Columbia, Canada, page 724--731. Association for Computational Linguistics Morristown, NJ, USA, (2005)
J. Jiang, and C. Zhai. Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics, page 113--120. Rochester, New York, Association for Computational Linguistics, (April 2007)
F. Zanzotto, and A. Moschitti. ACL-44: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, page 401--408. Morristown, NJ, USA, Association for Computational Linguistics, (2006)
P. Pantel, and M. Pennacchiotti. Ontology Learning and Population: Bridging the Gap between Text and Knowledge, volume 167 of Frontiers in Artificial Intelligence and Applications, IOS Press, (2008)
J. Kleinberg. KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, page 91--101. New York, NY, USA, ACM, (2002)
X. Wan, and J. Yang. SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, page 143--150. New York, NY, USA, ACM, (2007)
F. Suchanek, G. Ifrim, and G. Weikum. 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2006), page 712--717. New York, NY, USA, ACM, (2006)
T. Zesch, I. Gurevych, and M. Mühlhäuser. Proceedings of Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2007), page 205--208. (2007)
F. Reichartz, H. Korte, and G. Paass. Proceedings of the ACL-IJCNLP 2009 Conference Short Papers, page 365--368. Suntec, Singapore, Association for Computational Linguistics, (August 2009)