Mostra el registre d'ítem simple
Exploiting pseudo-schedules to guide data dependence graph partitioning
dc.contributor.author | Aleta Ortega, Alexandre |
dc.contributor.author | Codina Viñas, Josep M. |
dc.contributor.author | Sánchez Navarro, F. Jesús |
dc.contributor.author | González Colás, Antonio María |
dc.contributor.author | David, Kaeli |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors |
dc.date.accessioned | 2017-03-15T12:39:02Z |
dc.date.available | 2017-03-15T12:39:02Z |
dc.date.issued | 2002 |
dc.identifier.citation | Aleta, A., Codina, J.M., Sánchez, F., González, A., David, K. Exploiting pseudo-schedules to guide data dependence graph partitioning. A: International Conference on Parallel Architectues and Compilation Techniques. "2002 International Conference on Parallel Architectures and Compilation Techniques, PACT 2002: 22-25 September 2002, Charlottesville, Virginia, USA: proceedings". Charlottesville, Virginia: Institute of Electrical and Electronics Engineers (IEEE), 2002, p. 281-290. |
dc.identifier.isbn | 0-7695-1620-3 |
dc.identifier.uri | http://hdl.handle.net/2117/102511 |
dc.description.abstract | This paper presents a new modulo scheduling algorithm for clustered microarchitectures. The main feature of the proposed scheme is that the assignment of instructions to clusters is done by means of graph partitioning algorithms that are guided by a pseudo-scheduler. This pseudo-scheduler is a simplified version of the full instruction scheduler and estimates key constraints that would be encountered in the final schedule. The final scheduling process is bi-directional and includes on-the-fly spill code generation. The proposed scheme is evaluated against previous scheduling approaches using the SPECfp95 benchmark suite. Our modeling results show that better schedules are obtained for most programs across a range of different architectures. For a 4-cluster VLIW architecture with 32 registers and a 2-cycle inter-cluster communication delay we obtain an average speedup of 38.5%. |
dc.format.extent | 10 p. |
dc.language.iso | eng |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors |
dc.subject.lcsh | Embedded computer systems |
dc.subject.lcsh | Microprocessors |
dc.subject.other | Processor scheduling |
dc.subject.other | Microarchitecture |
dc.subject.other | VLIW |
dc.subject.other | Registers |
dc.subject.other | Computer architecture |
dc.subject.other | Delay |
dc.subject.other | Scheduling algorithm |
dc.subject.other | Floating-point arithmetic |
dc.subject.other | Clustering algorithms |
dc.subject.other | Partitioning algorithms |
dc.title | Exploiting pseudo-schedules to guide data dependence graph partitioning |
dc.type | Conference report |
dc.subject.lemac | Ordinadors immersos, Sistemes d' |
dc.subject.lemac | Microprocessadors |
dc.contributor.group | Universitat Politècnica de Catalunya. ARCO - Microarquitectura i Compiladors |
dc.identifier.doi | 10.1109/PACT.2002.1106027 |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | http://ieeexplore.ieee.org/document/1106027/ |
dc.rights.access | Open Access |
local.identifier.drac | 2395181 |
dc.description.version | Postprint (published version) |
local.citation.author | Aleta, A.; Codina, J.M.; Sánchez, F.; González, A.; David, K. |
local.citation.contributor | International Conference on Parallel Architectues and Compilation Techniques |
local.citation.pubplace | Charlottesville, Virginia |
local.citation.publicationName | 2002 International Conference on Parallel Architectures and Compilation Techniques, PACT 2002: 22-25 September 2002, Charlottesville, Virginia, USA: proceedings |
local.citation.startingPage | 281 |
local.citation.endingPage | 290 |