The knowledge graph lifecycle in NTT DATA
Document typeConference lecture
Rights accessOpen Access
ProjectDESARROLLO, OPERATIVA Y GOBERNANZA DE DATOS PARA SISTEMAS SOFTWARE BASADOS EN APRENDIZAJE AUTOMATICO (AEI-PID2020-117191RB-I00)
The Semantic Business Unit (SEMBU) in NTT DATA aims to increase the semantic interoper ability and accessibility of European institutions’ data projects by following Linked Open Data (LOD) principles to build controlled vocabularies and produce Knowledge Graphs (KGs). One of its most notable projects revolves around the CORDIS portal1, which publishes information about research and innovation projects funded by the European Commission. SEMBU pursues two main goals: (i) expose semantic data related to CORDIS via a SPARQL endpoint that facilitates access and reuse of quality scientific-related data, and (ii) design an efficient, incremental, and automated KG lifecycle to be used as a reference in other data projects. To that end, we have adopted state-of-the-art semantic technologies to support the creation and management of the KG with the goal of centralizing knowledge and providing an overall view of data assets that improve data governance, maintenance, and external interaction by data consumers. We have also identified some of their limitations which are tackled via an industrial PhD. This paper reports our experience, the obstacles, and proposals for generating and maintaining the CORDIS KG.
CitationFlores, J. [et al.]. The knowledge graph lifecycle in NTT DATA. A: International Semantic Web Conference. "Proceedings of the ISWC 2022 Posters, Demos and Industry Tracks: From Novel Ideas to Industrial Practice: co-located with 21st International Semantic Web Conference (ISWC 2022): virtual conference, Hangzhou, China, October 23-27, 2022". CEUR-WS.org, 2022, ISSN 1613-0073.