Fetching instruction streams

Ramírez Bellido, Alejandro; Santana Jaria, Oliverio J.; Larriba Pey, Josep; Valero Cortés, Mateo

doi:10.1109/MICRO.2002.1176264

dc.contributor.author	Ramírez Bellido, Alejandro
dc.contributor.author	Santana Jaria, Oliverio J.
dc.contributor.author	Larriba Pey, Josep
dc.contributor.author	Valero Cortés, Mateo
dc.contributor.other	Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned	2017-05-04T10:25:45Z
dc.date.available	2017-05-04T10:25:45Z
dc.date.issued	2002
dc.identifier.citation	Ramírez, A., Santana, O., Larriba, J., Valero, M. Fetching instruction streams. A: Annual IEEE/ACM International Symposium on Microarchitecture. "35th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-35): 18-22 November 2002, Istanbul, Turkey: proceedings". Istambul: Institute of Electrical and Electronics Engineers (IEEE), 2002, p. 371-382.
dc.identifier.isbn	0-7695-1859-1
dc.identifier.uri	http://hdl.handle.net/2117/104050
dc.description.abstract	Fetch performance is a very important factor because it effectively limits the overall processor performance. However there is little performance advantage in increasing front-end performance beyond what the back-end can consume. For each processor design, the target is to build the best possible fetch engine for the required performance level. A fetch engine will be better if it provides better performance, but also if it takes fewer resources, requires less chip area, or consumes less power. In this paper we propose a novel fetch architecture based on the execution of long streams of sequential instructions, taking maximum advantage of code layout optimizations. We describe our architecture in detail, and show that it requires less complexity and resources than other high performance fetch architectures like the trace cache, while providing a high fetch performance suitable for wide-issue superscalar processors. Our results show that using our fetch architecture and code layout optimizations obtains 10% higher performance than the EV8 fetch architecture, and 4% higher than the FTB architecture using state-of-the-art branch predictors, while being only 1.5% slower than the trace cache. Even in the absence of code layout optimizations, fetching instruction streams is still 10% faster than the EV8, and only 4% slower than the trace cache. Fetching instruction streams effectively exploits the special characteristics of layout optimized codes to provide a high fetch performance, close to that of a trace cache, but has a much lower cost and complexity, similar to that of a basic block architecture.
dc.format.extent	12 p.
dc.language.iso	eng
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.subject	Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject.lcsh	Microprocessors -- Design and construction
dc.subject.other	Performance evaluation
dc.subject.other	Computer architecture
dc.title	Fetching instruction streams
dc.type	Conference report
dc.subject.lemac	Microprocessadors -- Disseny i construcció
dc.contributor.group	Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi	10.1109/MICRO.2002.1176264
dc.description.peerreviewed	Peer Reviewed
dc.relation.publisherversion	http://ieeexplore.ieee.org/document/1176264/
dc.rights.access	Open Access
local.identifier.drac	2410319
dc.description.version	Postprint (published version)
local.citation.author	Ramírez, A.; Santana, O.; Larriba, J.; Valero, M.
local.citation.contributor	Annual IEEE/ACM International Symposium on Microarchitecture
local.citation.pubplace	Istambul
local.citation.publicationName	35th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-35): 18-22 November 2002, Istanbul, Turkey: proceedings
local.citation.startingPage	371
local.citation.endingPage	382

Fitxers d'aquest items

Nom:: 01176264.pdf
Mida:: 268,4Kb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Ponències/Comunicacions de congressos [784]
Ponències/Comunicacions de congressos [1.955]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Fetching instruction streams

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora