Zusammenfassung
As data analysts at a large information-intensive business, we often
have been asked to analyze new (to us) data sets. This experience
was the original motivation for our interest in the topics of exploratory
data mining and data quality. Most data mining and analysis techniques
assume that the data have been joined into a single table and cleaned,
and that the analyst already knows what she or he is looking for.
Unfortunately, the data set is usually dirty, composed of many tables,
and has unknown properties. Before any results can be produced, the
data must be cleaned and exploredâoften a long and difficult task.
Nutzer