Discovering and Reconciling Value Conflicts for Data Integration
The integration of data from autonomous and heterogeneous sources calls for the prior identification and resolution of semantic conflicts that may be present. Unfortunately, this requires the system integrator to sift through the data from disparate systems in a painstaking manner. In this paper, we suggest that this process can be (at least) partially automated by presenting a methodology and techniques for the discovery of potential semantic conflicts as well as the underlying data transformation needed to resolve the conflicts. Our methodology begins by classifying data value conflicts into two categories: context independent and context dependent. While context independent conflicts are usually caused by unexpected errors, the context dependent conflicts are primarily a result of the heterogeneity of underlying data sources. To facilitate data integration, data value conversion rules are proposed to describe the quantitative relationships among data values involving context dependent conflicts. A general approach is proposed to discover data value conversion rules from the data. The approach consists of five major steps: relevant attribute analysis, candidate model selection, conversion function generation, conversion function selection and conversion rule formation. It is being implemented in a prototype system, DIRECT, for business data using statistics based techniques. Preliminary study indicated that the proposed approach is promising
Year of publication: |
2002
|
---|---|
Authors: | Lu, Hongjun ; Fan, Weiguo ; Goh, Cheng Hian ; Madnick, Stuart ; Cheung, David W. |
Publisher: |
[S.l.] : SSRN |
Saved in:
freely available
Extent: | 1 Online-Ressource (24 p) |
---|---|
Type of publication: | Book / Working Paper |
Language: | English |
Notes: | Nach Informationen von SSRN wurde die ursprüngliche Fassung des Dokuments November 2001 erstellt |
Other identifiers: | 10.2139/ssrn.335580 [DOI] |
Source: | ECONIS - Online Catalogue of the ZBW |
Persistent link: https://www.econbiz.de/10014107369
Saved in favorites
Similar items by person
-
Direct : A System for Mining Data Value Conversion Rules from Disparate Data Sources
Fan, Weiguo, (2003)
-
DIRECT: A System for Mining Data Value Conversion Rules from Disparate Data Sources
Fan, Weiguo, (2003)
-
Context interchange : new features and formalisms for the intelligent integration of information
Goh, Cheng Hian, (2008)
- More ...