dc.contributor.author | Quintana Rodríguez, Francisca |
dc.contributor.author | Espasa Sans, Roger |
dc.contributor.author | Valero Cortés, Mateo |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors |
dc.date.accessioned | 2017-10-06T09:10:40Z |
dc.date.available | 2017-10-06T09:10:40Z |
dc.date.issued | 1998 |
dc.identifier.citation | Quintana, F., Espasa, R., Valero, M. A case for merging the ILP and DLP paradigms. A: Euromicro International Conference on Parallel, Distributed, and Network-Based Processing. "Proceedings of the 6th EUROMICRO Workshop on Parallel and Distributed Processing, PDP'98: University of Madrid: January 21-23, 1998, Madrid, Spain". Madrid: Institute of Electrical and Electronics Engineers (IEEE), 1998, p. 217-224. |
dc.identifier.isbn | 0-8186-8332-5 |
dc.identifier.uri | http://hdl.handle.net/2117/108433 |
dc.description.abstract | The goal of this paper is to show that instruction level parallelism (ILP) and data-level parallelism (DLP) can be merged in a single architecture to execute vectorizable code at a performance level that can not be achieved using either paradigm on its own. We will show that the combination of the two techniques yields very high performance at a low cost and a low complexity. We will show that this architecture can reach a performance equivalent to a superscalar processor that sustained 10 instructions per cycle. We will see that the machine exploiting both types of parallelism improves upon the ILP-only machine by factors of 1.5-1.8. We also present a study on the scalability of both paradigms and show that, when we increase resources to reach a 16-issue machine, the advantage of the ILP+DLP machine over the ILP-only machine increases up to 2.0-3.45. While the peak achieved IPC for the ILP machine is 4, the ILP+DLP machine exceeds 10 instructions per cycle. |
dc.format.extent | 8 p. |
dc.language.iso | eng |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors |
dc.subject.lcsh | Parallel processing (Electronic computers) |
dc.subject.lcsh | Computational complexity |
dc.subject.other | Performance evaluation |
dc.subject.other | Parallel architectures |
dc.title | A case for merging the ILP and DLP paradigms |
dc.type | Conference report |
dc.subject.lemac | Processament en paral·lel (Ordinadors) |
dc.subject.lemac | Complexitat computacional |
dc.contributor.group | Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions |
dc.identifier.doi | 10.1109/EMPDP.1998.647201 |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | http://ieeexplore.ieee.org/document/647201/ |
dc.rights.access | Open Access |
local.identifier.drac | 2420697 |
dc.description.version | Postprint (published version) |
local.citation.author | Quintana, F.; Espasa, R.; Valero, M. |
local.citation.contributor | Euromicro International Conference on Parallel, Distributed, and Network-Based Processing |
local.citation.pubplace | Madrid |
local.citation.publicationName | Proceedings of the 6th EUROMICRO Workshop on Parallel and Distributed Processing, PDP'98: University of Madrid: January 21-23, 1998, Madrid, Spain |
local.citation.startingPage | 217 |
local.citation.endingPage | 224 |