Mostra el registre d'ítem simple
An alternative view on data processing pipelines from the DOLAP 2019 perspective
dc.contributor.author | Romero Moral, Óscar |
dc.contributor.author | Wrembel, Robert |
dc.contributor.author | Song, Il-Yeol |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Enginyeria de Serveis i Sistemes d'Informació |
dc.date.accessioned | 2021-12-16T10:16:24Z |
dc.date.available | 2021-12-27T01:30:15Z |
dc.date.issued | 2020-09 |
dc.identifier.citation | Romero, O.; Wrembel, R.; Song, I. An alternative view on data processing pipelines from the DOLAP 2019 perspective. "Information systems", Setembre 2020, vol. 92, p. 1-4. |
dc.identifier.issn | 0306-4379 |
dc.identifier.uri | http://hdl.handle.net/2117/358649 |
dc.description.abstract | Data science requires constructing data processing pipelines (DPPs), which span diverse phases such as data integration, cleaning, pre-processing, and analysis. However, current solutions lack a strong data engineering perspective. As consequence, DPPs are error-prone, inefficient w.r.t. human efforts, and inefficient w.r.t. execution time. We claim that DPP design, development, testing, deployment, and execution should benefit from a standardized DPP architecture and from well-known data engineering solutions. This claim is supported by our experience in real projects and trends in the field, and it opens new paths for research and technology. With this spirit, we outline five research opportunities that represent novel trends towards building DPPs. Finally, we highlight that the best DOLAP 2019 papers selected for the DOLAP 2019 Information Systems Special Issue fall in this category and highlight the relevance of advanced data engineering for data science. |
dc.format.extent | 4 p. |
dc.language.iso | eng |
dc.publisher | Elsevier |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International |
dc.rights | © 2019 Elsevier |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/ |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Sistemes d'informació |
dc.subject.lcsh | Data mining |
dc.subject.lcsh | Databases |
dc.subject.other | Data integration |
dc.subject.other | ETL/ELT |
dc.subject.other | ETL optimization |
dc.subject.other | Data processing pipeline |
dc.subject.other | Metadata |
dc.subject.other | Data management |
dc.subject.other | Data analytics |
dc.title | An alternative view on data processing pipelines from the DOLAP 2019 perspective |
dc.type | Article |
dc.subject.lemac | Mineria de dades |
dc.subject.lemac | Bases de dades |
dc.contributor.group | Universitat Politècnica de Catalunya. inSSIDE - integrated Software, Service, Information and Data Engineering |
dc.identifier.doi | 10.1016/j.is.2019.101489 |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | https://www.sciencedirect.com/science/article/pii/S0306437919305411 |
dc.rights.access | Open Access |
local.identifier.drac | 30044332 |
dc.description.version | Postprint (author's final draft) |
local.citation.author | Romero, O.; Wrembel, R.; Song, I. |
local.citation.publicationName | Information systems |
local.citation.volume | 92 |
local.citation.startingPage | 1 |
local.citation.endingPage | 4 |
Fitxers d'aquest items
Aquest ítem apareix a les col·leccions següents
-
Articles de revista [113]
-
Articles de revista [222]