ARDI: automatic generation of RDFS models from heterogeneous data sources
| dc.contributor.author | Nigatu, Shumet Tadesse |
| dc.contributor.author | Gómez Seoane, Cristina |
| dc.contributor.author | Romero Moral, Óscar |
| dc.contributor.author | Hose, Katja |
| dc.contributor.author | Rabbani, Kashif |
| dc.contributor.group | Universitat Politècnica de Catalunya. inSSIDE - integrated Software, Service, Information and Data Engineering |
| dc.contributor.group | Universitat Politècnica de Catalunya. IMP - Information Modeling and Processing |
| dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Enginyeria de Serveis i Sistemes d'Informació |
| dc.date.accessioned | 2020-03-12T12:53:16Z |
| dc.date.available | 2020-03-12T12:53:16Z |
| dc.date.issued | 2019 |
| dc.description.abstract | The current wealth of information, typically known as Big Data, generates a large amount of available data for organisations. Data Integration provides foundations to query disparate data sources as if they were integrated into a single source. However, current data integration tools are far from being useful for most organisations due to the heterogeneous nature of data sources, which represents a challenge for current frameworks. To enable data integration of highly heterogeneous and disparate data sources, this paper proposes a method to extract the schema from semi-structured (such as JSON and XML) and structured (such as relational) data sources, and generate an equivalent RDFS representation. The output of our method complements current frameworks and reduces the manual workload required to represent the input data sources in terms of the integration canonical data model. Our approach consists of production rules at the meta-model level that guarantee the correctness of the model translations. Finally, a tool for implementing our approach has been developed. |
| dc.description.peerreviewed | Peer Reviewed |
| dc.description.version | Postprint (author's final draft) |
| dc.format.extent | 7 p. |
| dc.identifier.citation | Nigatu, S. [et al.]. ARDI: automatic generation of RDFS models from heterogeneous data sources. A: The Enterprise Computing Conference. "2019 IEEE 23rd International Enterprise Distributed Object Computing Conference, EDOC 2019: Paris, France, 28-31 October 2019: proceedings". Institute of Electrical and Electronics Engineers (IEEE), 2019, p. 190-196. |
| dc.identifier.doi | 10.1109/EDOC.2019.00031 |
| dc.identifier.isbn | 978-1-7281-2702-6 |
| dc.identifier.uri | https://hdl.handle.net/2117/179803 |
| dc.language.iso | eng |
| dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) |
| dc.relation.projectid | info:eu-repo/grantAgreement/MINECO/1PE/TIN2016-79269-R |
| dc.relation.publisherversion | https://ieeexplore.ieee.org/document/8945018 |
| dc.rights.access | Open Access |
| dc.subject | Àrees temàtiques de la UPC::Informàtica::Sistemes d'informació |
| dc.subject.lcsh | Big data |
| dc.subject.lcsh | Data integration (Computer science) |
| dc.subject.lcsh | Metadata |
| dc.subject.lemac | Macrodades |
| dc.subject.lemac | Metadades |
| dc.subject.other | Data model translation |
| dc.subject.other | Data integration |
| dc.subject.other | RDF schema |
| dc.subject.other | Meta-modeling |
| dc.title | ARDI: automatic generation of RDFS models from heterogeneous data sources |
| dc.type | Conference report |
| dspace.entity.type | Publication |
| local.citation.author | Nigatu, S.; Gómez, C.; Romero, O.; Hose, K.; Rabbani, K. |
| local.citation.contributor | The Enterprise Computing Conference |
| local.citation.endingPage | 196 |
| local.citation.publicationName | 2019 IEEE 23rd International Enterprise Distributed Object Computing Conference, EDOC 2019: Paris, France, 28-31 October 2019: proceedings |
| local.citation.startingPage | 190 |
| local.identifier.drac | 26430115 |
Fitxers
Paquet original
1 - 1 de 1



