dc.contributor.author | Codina Viñas, Josep M. |
dc.contributor.author | Sánchez Navarro, F. Jesús |
dc.contributor.author | González Colás, Antonio María |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors |
dc.date.accessioned | 2017-02-22T08:12:15Z |
dc.date.available | 2017-02-22T08:12:15Z |
dc.date.issued | 2001 |
dc.identifier.citation | Codina, J.M., Sánchez, F., González, A. A unified modulo scheduling and register allocation technique for clustered processors. A: International Conference on Parallel Architectures and Compilation Techniques. "2001 International Conference on Parallel Architectures and Compilation Techniques: 8-12 September 2001 Barcelona, Catalunya, Spain: proceedings". Barcelona: Institute of Electrical and Electronics Engineers (IEEE), 2001, p. 175-184. |
dc.identifier.isbn | 0-7695-1363-8 |
dc.identifier.uri | http://hdl.handle.net/2117/101361 |
dc.description.abstract | This work presents a modulo scheduling framework for clustered ILP processors that integrates the cluster assignment, instruction scheduling and register allocation steps in a single phase. This unified approach is more effective than traditional approaches based on sequentially performing some (or all) of the three steps, since it allows optimizing the global code generation problem instead of searching for optimal solutions to each individual step. Besides, it avoids the iterative nature of traditional approaches, which require repeated applications of the three steps until a valid solution is found. The proposed framework includes a mechanism to insert spill code on-the-fly and heuristics to evaluate the quality of partial schedules considering simultaneously inter-cluster communications, memory pressure and register pressure. Transformations that allow trading pressure on a type of resource for another resource are also included. We show that the proposed technique outperforms previously proposed techniques. For instance, the average speed-up for the SPECfp95 is 36% for a 4-cluster configuration. |
dc.format.extent | 10 p. |
dc.language.iso | eng |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors |
dc.subject.lcsh | Compilers (Computer programs) |
dc.subject.lcsh | Multiprocessors |
dc.subject.other | Modulo scheduling |
dc.subject.other | Register allocation |
dc.subject.other | Spill code |
dc.subject.other | Cluster assignment |
dc.subject.other | Clustered architectures |
dc.title | A unified modulo scheduling and register allocation technique for clustered processors |
dc.type | Conference report |
dc.subject.lemac | Compiladors (Programes d'ordinador) |
dc.subject.lemac | Multiprocessadors |
dc.contributor.group | Universitat Politècnica de Catalunya. ARCO - Microarquitectura i Compiladors |
dc.identifier.doi | 10.1109/PACT.2001.953298 |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=953298 |
dc.rights.access | Open Access |
local.identifier.drac | 2357089 |
dc.description.version | Postprint (published version) |
local.citation.author | Codina, J.M.; Sánchez, F.; González, A. |
local.citation.contributor | International Conference on Parallel Architectures and Compilation Techniques |
local.citation.pubplace | Barcelona |
local.citation.publicationName | 2001 International Conference on Parallel Architectures and Compilation Techniques: 8-12 September 2001 Barcelona, Catalunya, Spain: proceedings |
local.citation.startingPage | 175 |
local.citation.endingPage | 184 |