Abstract

The rapid proliferation of computer-based information systems is increasing the importance of data quality to both system makers and users. However, there is neither an established framework nor common terminology for investigating data quality. There is not even agreement on what the term “data” means. We lay a foundation for the study of data quality in this paper. In the first part of the paper we discuss five approaches to defining “data” in the literature. We then propose an approach especially conducive to discussing data quality. In the second part of the paper we discuss the most important dimensions of data quality: accuracy, completeness, consistency and currentness. We define these four and several related dimensions and discuss them in detail. We close the paper by outlining several areas for further research on data quality.

Links and resources

Tags

community

  • @voj
  • @dblp
@voj's tags highlighted