Mostra el registre d'ítem simple

dc.contributor.authorZalamea León, Francisco Javier
dc.contributor.authorLlosa Espuny, José Francisco
dc.contributor.authorAyguadé Parra, Eduard
dc.contributor.authorValero Cortés, Mateo
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned2015-11-06T14:45:09Z
dc.date.issued2004-05
dc.identifier.citationZalamea, F., Llosa, J., Ayguade, E., Valero, M. Register constrained modulo scheduling. "IEEE transactions on parallel and distributed systems", Maig 2004, vol. 15, núm. 5, p. 417-430.
dc.identifier.issn1045-9219
dc.identifier.urihttp://hdl.handle.net/2117/78908
dc.description.abstractSoftware pipelining is an instruction scheduling technique that exploits the instruction level parallelism (ILP) available in loops by overlapping operations from various successive loop iterations. The main drawback of aggressive software pipelining techniques is their high register requirements. If the requirements exceed the number of registers available in the target architecture, some steps need to be applied to reduce the register pressure (incurring some performance degradation): reduce iteration overlapping or spilling some lifetimes to memory. In the first part, we propose a set of heuristics to improve the spilling process and to better decide between adding spill code or directly decreasing the execution rate of iterations. The experimental evaluation, over a large number of representative loops and for a processor configuration, reports an increase in performance by a factor of 1.29 and a reduction of memory traffic by a factor of 1.36. In the second part, we analyze the use of backtracking and propose a novel approach for simultaneous instruction scheduling and register spilling in modulo scheduling: MIPS (modulo scheduling with integrated register spilling). The experimental evaluation reports an increase in performance by a factor of 1.46 and a reduction of the memory traffic by a factor of 1.66 (or an additional 1.13 and 1.22 with regard to the proposal in the first part). These improvements are achieved at the expense of a reasonable increase in the compilation time.
dc.format.extent14 p.
dc.language.isoeng
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Informàtica::Arquitectura de computadors::Arquitectures paral·leles
dc.subject.lcshParallel programming (Computer science)
dc.subject.lcshComputer architecture
dc.subject.otherBacktracking
dc.subject.otherGraph theory
dc.subject.otherInstruction sets
dc.subject.otherPipeline processing
dc.subject.otherProcessor scheduling
dc.subject.otherProgram control structures
dc.subject.otherResource allocation
dc.titleRegister constrained modulo scheduling
dc.typeArticle
dc.subject.lemacProgramació en paral·lel (Informàtica)
dc.subject.lemacArquitectura d'ordinadors
dc.contributor.groupUniversitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi10.1109/TPDS.2004.1278099
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=1278099
dc.rights.accessRestricted access - publisher's policy
local.identifier.drac654683
dc.description.versionPostprint (published version)
dc.date.lift10000-01-01
local.citation.authorZalamea, F.; Llosa, J.; Ayguade, E.; Valero, M.
local.citation.publicationNameIEEE transactions on parallel and distributed systems
local.citation.volume15
local.citation.number5
local.citation.startingPage417
local.citation.endingPage430


Fitxers d'aquest items

Imatge en miniatura

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple