A data quality framework for graph-based virtual data integration systems
View/Open
Cita com:
hdl:2117/375124
Document typeConference report
Defense date2022
PublisherSpringer
Rights accessOpen Access
All rights reserved. This work is protected by the corresponding intellectual and industrial
property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public
communication or transformation of this work are prohibited without permission of the copyright holder
Abstract
Data Quality (DQ) plays a critical role in data integration. Up to now, DQ has mostly been addressed from a single database perspective. Popular DQ frameworks rely on Integrity Constraints (IC) to enforce valid application semantics, which lead to the Denial Constraint (DC) formalism which models a broad range of ICs in real-world applications. Yet, current approaches are rather monolithic, considering a single database and do not suit data integration scenarios. In this paper, we address DQ for data integration systems. Specifically, we extend virtual data integration systems to elicit DCs from disparate data sources to be integrated, using DC-related state-of-the-art, and propagate them to the integrated schema (global DCs). Then, we propose a method to manage global DCs and identify (i) minimal DCs and (ii) potential clashes between them.
CitationLi, Y.; Nadal, S.; Romero, O. A data quality framework for graph-based virtual data integration systems. A: European Conference on Advances in Databases and Information Systems. "Advances in Databases and Information Systems: 26th European Conference, ADBIS 2022: Turin, Italy, September 5-8, 2022: proceedings". Berlín: Springer, 2022, p. 104-117. ISBN 978-3-031-15740-0. DOI 10.1007/978-3-031-15740-0_9.
ISBN978-3-031-15740-0
Publisher versionhttps://link.springer.com/chapter/10.1007/978-3-031-15740-0_9
Files | Description | Size | Format | View |
---|---|---|---|---|
2022-ADBIS.pdf | 525,6Kb | View/Open |