Mostra el registre d'ítem simple

dc.contributor.authorHayes, Timothy
dc.contributor.authorPalomar Pérez, Óscar
dc.contributor.authorUnsal, Osman Sabri
dc.contributor.authorCristal Kestelman, Adrián
dc.contributor.authorValero Cortés, Mateo
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.contributor.otherBarcelona Supercomputing Center
dc.date.accessioned2015-09-30T13:28:27Z
dc.date.issued2015
dc.identifier.citationHayes, T., Palomar, O., Unsal, O., Cristal, A., Valero, M. VSR sort: a novel vectorised sorting algorithm and architecture extensions for future microprocessors. A: International Symposium on High-Performance Computer Architecture. "2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA 2015): Burlingame, California, USA: 7-11 February 2015". San Francisco Bay Area, California: Institute of Electrical and Electronics Engineers (IEEE), 2015, p. 26-38.
dc.identifier.isbn978-1-4799-8931-7
dc.identifier.urihttp://hdl.handle.net/2117/77204
dc.description.abstractSorting is a widely studied problem in computer science and an elementary building block in many of its subfields. There are several known techniques to vectorise and accelerate a handful of sorting algorithms by using single instruction-multiple data (SIMD) instructions. It is expected that the widths and capabilities of SIMD support will improve dramatically in future microprocessor generations and it is not yet clear whether or not these sorting algorithms will be suitable or optimal when executed on them. This work extrapolates the level of SIMD support in future microprocessors and evaluates these algorithms using a simulation framework. The scalability, strengths and weaknesses of each algorithm are experimentally derived. We then propose VSR sort, our own novel vectorised non-comparative sorting algorithm based on radix sort. To facilitate the execution of this algorithm we define two new SIMD instructions and propose a complementary hardware structure for their execution. Our results show that VSR sort has maximum speedups between 14.9x and 20.6x over a scalar baseline and an average speedup of 3.4x over the next-best vectorised sorting algorithm.
dc.description.sponsorshipThe authors would like to thank Morteza Biglari-Abhari for reading this work. We are largely indebted to the efforts of Michael Swift who meticulously reviewed and critiqued the text multiple times leading to a considerably improved publication. We would also like to thank all of our anonymous reviewers, especially reviewer B for his or her highly detailed and insightful comments which helped develop the article. The research leading to these results has received funding from the European Union’s Seventh Framework Programme (FP7/2007-2013) under the AXLE project (GA no. 318633) and from the RoMoL ERC Advanced Grant (GA no. 321253). Timothy Hayes is also supported by a FPU research grant from the Spanish MECD.
dc.format.extent13 p.
dc.language.isoeng
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació
dc.subjectÀrees temàtiques de la UPC::Informàtica
dc.subject.lcshMicroprocessors
dc.subject.lcshSupercomputers
dc.subject.otherDigital arithmetic
dc.subject.otherMicroprocessor chips
dc.subject.otherParallel architectures
dc.subject.otherSorting
dc.titleVSR sort: a novel vectorised sorting algorithm and architecture extensions for future microprocessors
dc.typeConference report
dc.subject.lemacMicroprocessadors
dc.subject.lemacSupercomputadors
dc.contributor.groupUniversitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi10.1109/HPCA.2015.7056019
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=7056019
dc.rights.accessOpen Access
local.identifier.drac15430235
dc.description.versionPostprint (author's final draft)
local.citation.authorHayes, T.; Palomar, O.; Unsal, O.; Cristal, A.; Valero, M.
local.citation.contributorInternational Symposium on High-Performance Computer Architecture
local.citation.pubplaceSan Francisco Bay Area, California
local.citation.publicationName2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA 2015): Burlingame, California, USA: 7-11 February 2015
local.citation.startingPage26
local.citation.endingPage38


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple