Effective instruction scheduling techniques for an interleaved cache clustered VLIW processor
| dc.contributor.author | Gibert Codina, Enric |
| dc.contributor.author | Sánchez Navarro, Jesús |
| dc.contributor.author | González Colás, Antonio María |
| dc.contributor.group | Universitat Politècnica de Catalunya. ARCO - Microarquitectura i Compiladors |
| dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors |
| dc.date.accessioned | 2017-02-22T08:29:22Z |
| dc.date.available | 2017-02-22T08:29:22Z |
| dc.date.issued | 2002 |
| dc.description.abstract | Clustering is a common technique to overcome the wire delay problem incurred by the evolution of technology. Fully-distributed architectures, where the register file, the functional units and the data cache are partitioned, are particularly effective to deal with these constraints and besides they are very scalable. In this paper effective instruction scheduling techniques for a clustered VLIW processor with a word-interleaved cache are proposed Such scheduling techniques rely on: (i) loop unrolling and variable alignment to increase the percentage of local accesses, (ii) a latency assignment process to schedule memory operations with an appropriate latency and (iii) different heuristics to assign instructions to clusters. In particular, the number of local accesses is increased by more than 25% if these techniques are used and the ratio of stall time over compute time is small. Next, the main source of remote accesses and stall time is investigated. Stall time is mainly due to remote hits, and Attraction Buffers are used to increase local accesses and reduce stall time. Stall time is reduced by 29% and 34% depending on the scheduling heuristic. IPC results for a word-interleaved cache clustered VLIW processor are similar to those of the multiVLIW (a cache-coherent clustered processor with a more complex hardware design), and are 10% and 5% better (depending on the scheduling heuristic) than the IPC for a clustered processor with a unified cache. |
| dc.description.peerreviewed | Peer Reviewed |
| dc.description.version | Postprint (published version) |
| dc.format.extent | 11 p. |
| dc.identifier.citation | Gibert, E., Sánchez, J., González, A. Effective instruction scheduling techniques for an interleaved cache clustered VLIW processor. A: Annual IEEE/ACM International Symposium on Microarchitecture. "35th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-35): 18-22 November 2002, Istanbul, Turkey: proceedings". Istambul: Institute of Electrical and Electronics Engineers (IEEE), 2002, p. 123-133. |
| dc.identifier.doi | 10.1109/MICRO.2002.1176244 |
| dc.identifier.isbn | 0-7695-1859-1 |
| dc.identifier.uri | https://hdl.handle.net/2117/101363 |
| dc.language.iso | eng |
| dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) |
| dc.relation.publisherversion | http://ieeexplore.ieee.org/document/1176244/ |
| dc.rights.access | Open Access |
| dc.subject | Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors |
| dc.subject.lcsh | Parallel processing (Electronic computers) |
| dc.subject.lcsh | Cache memory |
| dc.subject.lemac | Processament en paral·lel (Ordinadors) |
| dc.subject.lemac | Memòria cau |
| dc.subject.other | Processor scheduling |
| dc.subject.other | VLIW |
| dc.subject.other | Delay |
| dc.subject.other | Computer architecture |
| dc.subject.other | Memory architecture |
| dc.subject.other | Protocols |
| dc.subject.other | Interleaved codes |
| dc.subject.other | Computer aided instruction |
| dc.subject.other | Argon |
| dc.subject.other | Microarchitecture |
| dc.title | Effective instruction scheduling techniques for an interleaved cache clustered VLIW processor |
| dc.type | Conference report |
| dspace.entity.type | Publication |
| local.citation.author | Gibert, E.; Sánchez, J.; González, A. |
| local.citation.contributor | Annual IEEE/ACM International Symposium on Microarchitecture |
| local.citation.endingPage | 133 |
| local.citation.publicationName | 35th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-35): 18-22 November 2002, Istanbul, Turkey: proceedings |
| local.citation.pubplace | Istambul |
| local.citation.startingPage | 123 |
| local.identifier.drac | 2396013 |
Fitxers
Paquet original
1 - 1 de 1



