This Thinking XML column shows how to combine metadata collected from multiple XML source documents into a single Resource Description Framework (RDF) model for effective querying. In this follow-up to his previous installment that introduced how to use X
It might seem redundant to have semantic tagging when you can basically find anything you can think of with simple searches in Google or Yahoo. But del.icio.us seems to be most surprising when you're trying to find things that relate to what you're intere
Much as people and economies depend on information, the exchange of data has often been hindered by the incompatible formats of proprietary hardware and software. That was less of a problem when computers rarely communicated with each other, but now it's
Repeat after me: "There is no syntax." In order to use and gain advantage from RDF, you do not have to use any particular syntax -- not even the syntax specified in the RDF 1.0 specification. Uche Ogbuji discusses the importance of XML/RDF interchange, of
It is important to differentiate between text data mining and information access (or information retrieval, as it is more widely known)... the goal of data mining is to discover or derive new information from data, finding patterns across datasets, and/o
How do you share a generally-understood reference, not to a web service, but to a calendar event on the service? How do you index it? Google indexes pages, not services. How, indeed do you bookmark it? Hidden data means no URLs...The Internet is driven by
...microformats are (in Alex's words) "adding semantics to markup to take it from being machine readable to being machine understandable." So what use would microformats be in a web browser? Alex explains that microformats will make the Web Browser
Data mining (DM), also called Knowledge-Discovery in Databases (KDD) or Knowledge-Discovery and Data Mining, is the process of automatically searching large volumes of data for patterns using tools such as classification, association rule mining, clusteri
This column, the third in a series, shows how to add semantic knowledge to an RDF application by incorporating WordNet synonym sets. With the added knowledge of the WordNet lexical database, you can search a set of RDF data for related concepts, not just
Lexical ambiguity is a fundamental problem in Information Retrieval (IR), especially in the medical domain. Many systems use a subset of the words contained in the document to represent the content, but they are faced with the problem of ambiguity.
DBin is general purpose Semantic Web application that enables power users (domain experts) to create "discussion groups" where users annotate any subject of interest (from "beers" as in our example to anything really). At low level, these annotatins are e
Ex: When viewing http://www.bibsonomy.org/user/schmitz/linux, you will be offered the possibility of seeing the tag linux as a concept. That way, you will also see those resources not tagged with linux themselves, but with a direct subtag thereof, for exa
Uche Ogbuji moves on to a discussion of a far more sophisticated RDF query language than the primitive API he has discussed thus far. This is the foundation for establishing the middleware for the Issue Tracker article in coming installments. So far, in
Recent Developments on the Internet: The common characteristic, which all these systems share, is that the approach is "bottom-up" rather than "top-down". This means that in these environments content and structure are not determined by professional, corp