Show simple item record

dc.contributor.authorBerna Juan, Alejandro
dc.contributor.authorJiménez Castells, Marta
dc.contributor.authorLlaberia Griñó, José M.
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.identifier.citationBerna, A.; Jimenez, M.; Llaberia, J. Source-to-Source transformations for efficient SIMD code generation. A: Jornadas de Paralelismo. "Actas de las XXII Jornadas de Paralelismo". La Laguna, Tenerife: 2011, p. 719-726.
dc.description.abstractIn the last years, there has been much effort in commercial compilers to generate efficient SIMD instructions-based code sequences from conventional sequential programs. However, the small numbers of compilers that can automatically use these instructions achieve in most cases unsatisfactory results. Therefore, the code often has to be written manually in assembly language or using compiler built-in functions to achieve high performance. In this work, we present source-to-source transformations that help commercial vectorizing compilers to generate efficient SIMD code. Experimental results show that excellent performance can be achieved. In particular, for the problem of matrix product (SGEMM) we almost achieve as high performance as hand-optimized numerical libraries. Our source-tosource transformations are based on the scalar replacement and unroll and jam transformations presented by Callahan et all. In particular, we extend the use of scalar replacement to vectorial replacement and combine this transformation with unroll and jam and outer loop vectorization to fully exploit the vector register level and thus to help the compiler to generate efficient SIMD code. We will show experimentally the effectiveness of our proposal.
dc.format.extent8 p.
dc.subjectÀrees temàtiques de la UPC::Informàtica::Sistemes d'informació
dc.subject.lcshCompilers (Computer programs)
dc.subject.lcshComputer algorithms
dc.titleSource-to-Source transformations for efficient SIMD code generation
dc.typeConference report
dc.subject.lemacAlgorismes -- Programació (Ordinadors)
dc.subject.lemacAlgorismes -- Informàtica
dc.subject.lemacCompiladors (Programes d'ordinador)
dc.contributor.groupUniversitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.description.peerreviewedPeer Reviewed
dc.rights.accessOpen Access
dc.description.versionPostprint (published version)
upcommons.citation.authorBerna, A.; Jimenez, M.; Llaberia, J.
upcommons.citation.contributorJornadas de Paralelismo
upcommons.citation.pubplaceLa Laguna, Tenerife
upcommons.citation.publicationNameActas de las XXII Jornadas de Paralelismo

Files in this item


This item appears in the following Collection(s)

Show simple item record

All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder