Show simple item record

dc.contributor.authorRomero Moral, Óscar
dc.contributor.authorWrembel, Robert
dc.contributor.authorSong, Il-Yeol
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Enginyeria de Serveis i Sistemes d'Informació
dc.date.accessioned2021-12-16T10:16:24Z
dc.date.available2021-12-27T01:30:15Z
dc.date.issued2020-09
dc.identifier.citationRomero, O.; Wrembel, R.; Song, I. An alternative view on data processing pipelines from the DOLAP 2019 perspective. "Information systems", Setembre 2020, vol. 92, p. 1-4.
dc.identifier.issn0306-4379
dc.identifier.urihttp://hdl.handle.net/2117/358649
dc.description.abstractData science requires constructing data processing pipelines (DPPs), which span diverse phases such as data integration, cleaning, pre-processing, and analysis. However, current solutions lack a strong data engineering perspective. As consequence, DPPs are error-prone, inefficient w.r.t. human efforts, and inefficient w.r.t. execution time. We claim that DPP design, development, testing, deployment, and execution should benefit from a standardized DPP architecture and from well-known data engineering solutions. This claim is supported by our experience in real projects and trends in the field, and it opens new paths for research and technology. With this spirit, we outline five research opportunities that represent novel trends towards building DPPs. Finally, we highlight that the best DOLAP 2019 papers selected for the DOLAP 2019 Information Systems Special Issue fall in this category and highlight the relevance of advanced data engineering for data science.
dc.format.extent4 p.
dc.language.isoeng
dc.publisherElsevier
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights© 2019 Elsevier
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectÀrees temàtiques de la UPC::Informàtica::Sistemes d'informació
dc.subject.lcshData mining
dc.subject.lcshDatabases
dc.subject.otherData integration
dc.subject.otherETL/ELT
dc.subject.otherETL optimization
dc.subject.otherData processing pipeline
dc.subject.otherMetadata
dc.subject.otherData management
dc.subject.otherData analytics
dc.titleAn alternative view on data processing pipelines from the DOLAP 2019 perspective
dc.typeArticle
dc.subject.lemacMineria de dades
dc.subject.lemacBases de dades
dc.contributor.groupUniversitat Politècnica de Catalunya. inSSIDE - integrated Software, Service, Information and Data Engineering
dc.identifier.doi10.1016/j.is.2019.101489
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttps://www.sciencedirect.com/science/article/pii/S0306437919305411
dc.rights.accessOpen Access
local.identifier.drac30044332
dc.description.versionPostprint (author's final draft)
local.citation.authorRomero, O.; Wrembel, R.; Song, I.
local.citation.publicationNameInformation systems
local.citation.volume92
local.citation.startingPage1
local.citation.endingPage4


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record