dc.contributor.author | Knijnenburg, Peter M.W. |
dc.contributor.author | Ramírez Bellido, Alejandro |
dc.contributor.author | Latorre Salinas, Fernando |
dc.contributor.author | Larriba Pey, Josep |
dc.contributor.author | Valero Cortés, Mateo |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors |
dc.date.accessioned | 2017-09-15T09:02:07Z |
dc.date.available | 2017-09-15T09:02:07Z |
dc.date.issued | 2002 |
dc.identifier.citation | Knijnenburg, P., Ramírez, A., Latorre, F., Larriba, J., Valero, M. Branch classification to control instruction fetch in simultaneous multithreaded architectures. A: International Workshop on Innovative Architecture for Future Generation High-Performance Processors and Systems. "International Workshop on Innovative Architecture for Future Generation High-Performance Processors and Systems". Big Island, HI: Institute of Electrical and Electronics Engineers (IEEE), 2002, p. 67-76. |
dc.identifier.isbn | 0-7695-1635-1 |
dc.identifier.uri | http://hdl.handle.net/2117/107650 |
dc.description.abstract | In simultaneous multithreaded architectures many separate threads are running concurrently, sharing processor resources, thereby realizing a high utilization rate of the available hardware. However, this also implies that threads are competing for resources and in many cases this competition can actually degrade overall performance. There are two major causes for this: first, instructions that, because of a long latency data cache miss, cause dependent instructions not to proceed for many cycles thereby wasting space in the instruction queues, and second, execution of instructions that belong to a mispredicted path. Both of these have a harmful effect on throughput and the second moreover wastes energy. in this paper we propose a fetch policy that avoids issuing instructions to the pipeline if we are not confident that the instruction belongs to the correct execution path. In this way, we avoid using resources for instructions that will not contribute to performance. This fetch policy, called agstall, is based on a dynamic branch classification mechanism. Branch instances are classified as either strongly biased or not strongly biased. We consider all strongly biased branches as easy to predict, and we stall the thread on branches that are not strongly biased to avoid mispredicting them. Our results show that agstall achieves similar or better performance than icount, and reduces by up to 86% the number of wrong-path instructions executed. |
dc.format.extent | 10 p. |
dc.language.iso | eng |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors |
dc.subject.lcsh | Cache memory |
dc.subject.lcsh | Parallel processing (Electronic computers) |
dc.subject.other | Parallel architectures |
dc.subject.other | Multi-threading |
dc.subject.other | Cache storage |
dc.title | Branch classification to control instruction fetch in simultaneous multithreaded architectures |
dc.type | Conference report |
dc.subject.lemac | Memòria cau |
dc.subject.lemac | Processament en paral·lel (Ordinadors) |
dc.contributor.group | Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions |
dc.contributor.group | Universitat Politècnica de Catalunya. ARCO - Microarquitectura i Compiladors |
dc.identifier.doi | 10.1109/IWIA.2002.1035020 |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | http://ieeexplore.ieee.org/document/1035020/ |
dc.rights.access | Open Access |
local.identifier.drac | 2345581 |
dc.description.version | Postprint (published version) |
local.citation.author | Knijnenburg, P.; Ramírez, A.; Latorre, F.; Larriba, J.; Valero, M. |
local.citation.contributor | International Workshop on Innovative Architecture for Future Generation High-Performance Processors and Systems |
local.citation.pubplace | Big Island, HI |
local.citation.publicationName | International Workshop on Innovative Architecture for Future Generation High-Performance Processors and Systems |
local.citation.startingPage | 67 |
local.citation.endingPage | 76 |