Data preparation using data quality matrices for classification mining
Data mining aims to find patterns in organizational databases. However, most techniques in mining do not consider knowledge of the quality of the database. In this work, we show how to incorporate into classification mining recent advances in the data quality field that view a database as the product of an imprecise manufacturing process where the flaws/defects are captured in quality matrices. We develop a general purpose method of incorporating data quality matrices into the data mining classification task. Our work differs from existing data preparation techniques since while other approaches detect and fix errors to ensure consistency with the entire data set our work makes use of the apriori knowledge of how the data is produced/manufactured.
Year of publication: |
2009
|
---|---|
Authors: | Davidson, Ian ; Tayi, Giri |
Published in: |
European Journal of Operational Research. - Elsevier, ISSN 0377-2217. - Vol. 197.2009, 2, p. 764-772
|
Publisher: |
Elsevier |
Keywords: | Data manufacturing Data quality Data preparation Application of data mining |
Saved in:
Saved in favorites
Similar items by person
-
Data preparation using data quality matrices for classification mining
Davidson, Ian, (2009)
-
Data preparation using data quality matrices for classification mining
Davidson, Ian, (2009)
-
Building a Certification and Inspection Data Infrastructure to Promote Transparent Markets
Luciano, Joanne S., (2017)
- More ...