Zusammenfassung
The extraction of semantics of unstructured documents requires the recognition and classification of textual patterns, their variability, and their inter-relationships, i.e., the analysis of the linguistic structure of documents. Being the integral part of a larger real-life application, this linguistic analysis process must be robust, fast and adaptable. This creates a big challenge for the development of the necessary linguistic base components. In this drill-down, we present several dimensions of this challenge and show how they have been successfully tackled in Ordo.
Nutzer