Mostra el registre d'ítem simple

dc.contributor.authorMartínez Palau, Xavier
dc.contributor.authorDomínguez Sal, David
dc.contributor.authorLarriba Pey, Josep
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned2015-01-12T15:12:35Z
dc.date.created2010-09-01
dc.date.issued2010-09-01
dc.identifier.citationMartínez, X.; Domínguez, D.; Larriba, J. Two-way replacement selection. "The proceedings of the VLDB Endowment", 01 Setembre 2010, vol. 3, núm. 1-2, p. 871-881.
dc.identifier.issn2150-8097
dc.identifier.urihttp://hdl.handle.net/2117/25227
dc.description.abstractThe performance of external sorting using merge sort is highly dependent on the length of the runs generated. One of the most commonly used run generation strategies is Replacement Selection (RS) because, on average, it generates runs that are twice the size of the memory available. However, the length of the runs generated by RS is downsized for data with certain characteristics, like inputs sorted inversely with respect to the desired output order. The goal of this paper is to propose and analyze two-way replacement selection (2WRS), which is a generalization of RS obtained by implementing two heaps instead of the single heap implemented by RS. The appropriate management of these two heaps allows generating runs larger than the memory available in a stable way, i.e. independent from the characteristics of the datasets. Depending on the changing characteristics of the input dataset, 2WRS assigns a new data record to one or the other heap, and grows or shrinks each heap, accommodating to the growing or decreasing tendency of the dataset. On average, 2WRS creates runs of at least the length generated by RS, and longer for datasets that combine increasing and decreasing data subsets. We tested both algorithms on large datasets with different characteristics and 2WRS achieves speedups at least similar to RS, and over 2.5 when RS fails to generate large runs.
dc.format.extent11 p.
dc.language.isoeng
dc.publisherAssociation for Computing Machinery (ACM)
dc.subjectÀrees temàtiques de la UPC::Informàtica::Programació
dc.subject.lcshSorting (Electronic computers)
dc.subject.otherSorting
dc.subject.otherOut of core sorting
dc.subject.otherHeap sort
dc.subject.otherExternal sorting
dc.subject.otherReplacement selection
dc.subject.otherMerge sort
dc.subject.otherRun formation
dc.titleTwo-way replacement selection
dc.typeArticle
dc.subject.lemacProgramació (Ordinadors)
dc.contributor.groupUniversitat Politècnica de Catalunya. DAMA-UPC - Data Management Group
dc.identifier.doi10.14778/1920841.1920952
dc.relation.publisherversionhttp://dl.acm.org/citation.cfm?doid=1920841.1920952
dc.rights.accessRestricted access - publisher's policy
local.identifier.drac15347279
dc.description.versionPostprint (published version)
dc.date.lift10000-01-01
local.citation.authorMartínez, X.; Domínguez, D.; Larriba, J.
local.citation.publicationNameThe proceedings of the VLDB Endowment
local.citation.volume3
local.citation.number1-2
local.citation.startingPage871
local.citation.endingPage881


Fitxers d'aquest items

Imatge en miniatura

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple