Register constrained modulo scheduling

Zalamea León, Francisco Javier; Llosa Espuny, José Francisco; Ayguadé Parra, Eduard; Valero Cortés, Mateo

doi:10.1109/TPDS.2004.1278099

dc.contributor.author	Zalamea León, Francisco Javier
dc.contributor.author	Llosa Espuny, José Francisco
dc.contributor.author	Ayguadé Parra, Eduard
dc.contributor.author	Valero Cortés, Mateo
dc.contributor.other	Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned	2015-11-06T14:45:09Z
dc.date.issued	2004-05
dc.identifier.citation	Zalamea, F., Llosa, J., Ayguade, E., Valero, M. Register constrained modulo scheduling. "IEEE transactions on parallel and distributed systems", Maig 2004, vol. 15, núm. 5, p. 417-430.
dc.identifier.issn	1045-9219
dc.identifier.uri	http://hdl.handle.net/2117/78908
dc.description.abstract	Software pipelining is an instruction scheduling technique that exploits the instruction level parallelism (ILP) available in loops by overlapping operations from various successive loop iterations. The main drawback of aggressive software pipelining techniques is their high register requirements. If the requirements exceed the number of registers available in the target architecture, some steps need to be applied to reduce the register pressure (incurring some performance degradation): reduce iteration overlapping or spilling some lifetimes to memory. In the first part, we propose a set of heuristics to improve the spilling process and to better decide between adding spill code or directly decreasing the execution rate of iterations. The experimental evaluation, over a large number of representative loops and for a processor configuration, reports an increase in performance by a factor of 1.29 and a reduction of memory traffic by a factor of 1.36. In the second part, we analyze the use of backtracking and propose a novel approach for simultaneous instruction scheduling and register spilling in modulo scheduling: MIPS (modulo scheduling with integrated register spilling). The experimental evaluation reports an increase in performance by a factor of 1.46 and a reduction of the memory traffic by a factor of 1.66 (or an additional 1.13 and 1.22 with regard to the proposal in the first part). These improvements are achieved at the expense of a reasonable increase in the compilation time.
dc.format.extent	14 p.
dc.language.iso	eng
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subject	Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors::Arquitectures paral·leles
dc.subject.lcsh	Parallel programming (Computer science)
dc.subject.lcsh	Computer architecture
dc.subject.other	Backtracking
dc.subject.other	Graph theory
dc.subject.other	Instruction sets
dc.subject.other	Pipeline processing
dc.subject.other	Processor scheduling
dc.subject.other	Program control structures
dc.subject.other	Resource allocation
dc.title	Register constrained modulo scheduling
dc.type	Article
dc.subject.lemac	Programació en paral·lel (Informàtica)
dc.subject.lemac	Arquitectura d'ordinadors
dc.contributor.group	Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi	10.1109/TPDS.2004.1278099
dc.description.peerreviewed	Peer Reviewed
dc.relation.publisherversion	http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=1278099
dc.rights.access	Restricted access - publisher's policy
local.identifier.drac	654683
dc.description.version	Postprint (published version)
dc.date.lift	10000-01-01
local.citation.author	Zalamea, F.; Llosa, J.; Ayguade, E.; Valero, M.
local.citation.publicationName	IEEE transactions on parallel and distributed systems
local.citation.volume	15
local.citation.number	5
local.citation.startingPage	417
local.citation.endingPage	430

Fitxers d'aquest items

Nom:: Register Constrained Modulo ...
Mida:: 1,420Mb
Format:: PDF
Descripció:: Register Constrained Modulo ...

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Articles de revista [1.049]
Articles de revista [382]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Register constrained modulo scheduling

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora