Mostra el registre d'ítem simple

dc.contributorAbelló Gamazo, Alberto
dc.contributor.authorMendt Peters, Tamara Desiree
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Enginyeria de Serveis i Sistemes d'Informació
dc.date.accessioned2015-10-19T12:36:56Z
dc.date.available2015-10-19T12:36:56Z
dc.date.issued2015-07-31
dc.identifier.urihttp://hdl.handle.net/2117/77883
dc.description.abstractShared nothing parallel data ow systems aim to bridge the gap between MapReduce and RDBMSs by combining parallel execution of second order functions with operator based optimizations. In parallel systems, job latency is strongly affected by data shuffling and unbalanced data across nodes, thus the degree of parallelism and the data partition- ing functions must be carefully considered when choosing optimization strategies. However, it is hard to make good optimization choices with- out any information about the distribution of the data. We attempt to overcome this challenge in shared nothing parallel data ows by tracking statistics of data sets during query runtime. We use data streaming algo- rithms to track statistics so as to affect job latency as little as possible. We discuss how collected statistics can potentially be used to improve execution plans during runtime.
dc.language.isoeng
dc.publisherUniversitat Politècnica de Catalunya
dc.subjectÀrees temàtiques de la UPC::Informàtica
dc.subject.lcshParallel computers
dc.titleCardinality Estimation in Shared-Nothing Parallel Dataflow Engines
dc.typeMaster thesis
dc.subject.lemacOrdinadors paral·lels
dc.identifier.slug108938
dc.rights.accessOpen Access
dc.date.updated2015-10-01T04:00:52Z
dc.audience.educationlevelMàster
dc.audience.mediatorFacultat d'Informàtica de Barcelona
dc.audience.degreeMÀSTER UNIVERSITARI ERASMUS MUNDUS EN TECNOLOGIES DE LA INFORMACIÓ PER A LA INTEL·LIGÈNCIA EMPRESARIAL (Pla 2012)


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple