ARDI: automatic generation of RDFS models from heterogeneous data sources

dc.contributor.authorNigatu, Shumet Tadesse
dc.contributor.authorGómez Seoane, Cristina
dc.contributor.authorRomero Moral, Óscar
dc.contributor.authorHose, Katja
dc.contributor.authorRabbani, Kashif
dc.contributor.groupUniversitat Politècnica de Catalunya. inSSIDE - integrated Software, Service, Information and Data Engineering
dc.contributor.groupUniversitat Politècnica de Catalunya. IMP - Information Modeling and Processing
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Enginyeria de Serveis i Sistemes d'Informació
dc.date.accessioned2020-03-12T12:53:16Z
dc.date.available2020-03-12T12:53:16Z
dc.date.issued2019
dc.description.abstractThe current wealth of information, typically known as Big Data, generates a large amount of available data for organisations. Data Integration provides foundations to query disparate data sources as if they were integrated into a single source. However, current data integration tools are far from being useful for most organisations due to the heterogeneous nature of data sources, which represents a challenge for current frameworks. To enable data integration of highly heterogeneous and disparate data sources, this paper proposes a method to extract the schema from semi-structured (such as JSON and XML) and structured (such as relational) data sources, and generate an equivalent RDFS representation. The output of our method complements current frameworks and reduces the manual workload required to represent the input data sources in terms of the integration canonical data model. Our approach consists of production rules at the meta-model level that guarantee the correctness of the model translations. Finally, a tool for implementing our approach has been developed.
dc.description.peerreviewedPeer Reviewed
dc.description.versionPostprint (author's final draft)
dc.format.extent7 p.
dc.identifier.citationNigatu, S. [et al.]. ARDI: automatic generation of RDFS models from heterogeneous data sources. A: The Enterprise Computing Conference. "2019 IEEE 23rd International Enterprise Distributed Object Computing Conference, EDOC 2019: Paris, France, 28-31 October 2019: proceedings". Institute of Electrical and Electronics Engineers (IEEE), 2019, p. 190-196.
dc.identifier.doi10.1109/EDOC.2019.00031
dc.identifier.isbn978-1-7281-2702-6
dc.identifier.urihttps://hdl.handle.net/2117/179803
dc.language.isoeng
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.relation.projectidinfo:eu-repo/grantAgreement/MINECO/1PE/TIN2016-79269-R
dc.relation.publisherversionhttps://ieeexplore.ieee.org/document/8945018
dc.rights.accessOpen Access
dc.subjectÀrees temàtiques de la UPC::Informàtica::Sistemes d'informació
dc.subject.lcshBig data
dc.subject.lcshData integration (Computer science)
dc.subject.lcshMetadata
dc.subject.lemacMacrodades
dc.subject.lemacMetadades
dc.subject.otherData model translation
dc.subject.otherData integration
dc.subject.otherRDF schema
dc.subject.otherMeta-modeling
dc.titleARDI: automatic generation of RDFS models from heterogeneous data sources
dc.typeConference report
dspace.entity.typePublication
local.citation.authorNigatu, S.; Gómez, C.; Romero, O.; Hose, K.; Rabbani, K.
local.citation.contributorThe Enterprise Computing Conference
local.citation.endingPage196
local.citation.publicationName2019 IEEE 23rd International Enterprise Distributed Object Computing Conference, EDOC 2019: Paris, France, 28-31 October 2019: proceedings
local.citation.startingPage190
local.identifier.drac26430115

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
EDOC-CameraReady.pdf
Mida:
1.46 MB
Format:
Adobe Portable Document Format