Show simple item record

dc.contributorAbelló Gamazo, Alberto
dc.contributorRomero Moral, Óscar
dc.contributor.authorGonzález Alonso, Pedro Javier
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Enginyeria de Serveis i Sistemes d'Informació
dc.date.accessioned2017-07-14T07:45:10Z
dc.date.available2017-07-14T07:45:10Z
dc.date.issued2016-10-20
dc.identifier.urihttp://hdl.handle.net/2117/106397
dc.description.abstractNowadays, business analytical users need agile processes spanning from the selection of relevant data from raw data sources to the generation of data structures prepared to serve as input for OLAP, Data Mining and/or other analytical tools. However, the wide range of analytical needs and the increasingly need of adaptive Business strategies discourages the use of the ’All-In-One’ existing suites (i.e., end-to-end Solutions from a single vendor). Oppositely, an agile approach suiteindependent is advisable to boost user’s independence from a specific vendor and the analytical capabilities enabled by combining several suites / tools according to the user’s needs. In this thesis we present and develop ’SETA’, a suite-independent agile analytical framework by proposing a novel approach combining rich metadata definition and automation components. As proof of validity, we instantiate the developed framework in a real-world project for the WHO Chagas Programme. This thesis introduces two main contributions. First, an approach to store and integrate a set of heterogeneous data sources into a flexible data store in some intermediate point between the classical Data Warehouse (DW) approaches and the recent Data Lake strategies. We argue that classical DW systems are too rigid to accommodate agile analytical pipelines, whereas Data Lakes and Big Data technologies are not suitable to much of today’s organizations. Thus, a novel approach combining both approaches is presented. Second, a rich definitional system to represent 1) the data components at Source, Global Schema and Domain levels, 2) the data mappings between this levels and 3) the final user analytical requirements. This definitional system provides a flexible view of the data schema at different levels and habilitates the automation of the target data schemas and the ETL to feed them.
dc.language.isoeng
dc.publisherUniversitat Politècnica de Catalunya
dc.subjectÀrees temàtiques de la UPC::Informàtica
dc.subject.lcshData warehousing
dc.subject.lcshOntology
dc.subject.otherData Lake
dc.subject.othermetamodel
dc.subject.othernon SQL
dc.subject.otherdocument stores
dc.subject.othersemantic awareness
dc.subject.othermultidimensional modeling
dc.subject.otherOLAP
dc.subject.otherETL
dc.subject.otherOWL
dc.titleSETA: A suite-independent analytical framework
dc.typeMaster thesis
dc.subject.lemacGestor de dades
dc.subject.lemacOntologia
dc.identifier.slug118236
dc.rights.accessOpen Access
dc.date.updated2016-10-24T04:00:28Z
dc.audience.educationlevelMàster
dc.audience.mediatorFacultat d'Informàtica de Barcelona
dc.audience.degreeMÀSTER UNIVERSITARI EN INNOVACIÓ I RECERCA EN INFORMÀTICA (Pla 2012)


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record