Trace-level reuse

González Colás, Antonio María; Tubella Murgadas, Jordi; Molina, Carlos

doi:10.1109/ICPP.1999.797385

dc.contributor.author	González Colás, Antonio María
dc.contributor.author	Tubella Murgadas, Jordi
dc.contributor.author	Molina, Carlos
dc.contributor.other	Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned	2017-06-09T09:54:02Z
dc.date.available	2017-06-09T09:54:02Z
dc.date.issued	1999
dc.identifier.citation	González, A., Tubella, J., Molina, C. Trace-level reuse. A: International Conference on Parallel Processing. "1999 InternationaI Conference on Parallel Processing: 21-24 September 1999, Aizu-Wakamatsu City, Japan: proceedings". Aizu-Wakamatsu: Institute of Electrical and Electronics Engineers (IEEE), 1999, p. 30-37.
dc.identifier.isbn	0-7695-0350-0
dc.identifier.uri	http://hdl.handle.net/2117/105273
dc.description.abstract	Trace-level reuse is based on the observation that some traces (dynamic sequences of instructions) are frequently repeated during the execution of a program, and in many cases, the instructions that make up such traces have the same source operand values. The execution of such traces will obviously produce the same outcome and thus, their execution can be skipped if the processor records the outcome of previous executions. This paper presents an analysis of the performance potential of trace-level reuse and discusses a preliminary realistic implementation. Like instruction-level reuse, trace-level reuse can improve performance by decreasing resource contention and the latency of some instructions. However, we show that trace-level reuse is more effective than instruction-level reuse because the former can avoid fetching the instructions of reused traces. This has two important benefits: it reduces the fetch bandwidth requirements, and it increases the effective instruction window size since these instructions do not occupy window entries. Moreover, trace-level reuse can compute all at once the result of a chain of dependent instructions, which may allow the processor to avoid the serialization caused by data dependences and thus, to potentially exceed the dataflow limit.
dc.format.extent	8 p.
dc.language.iso	eng
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.subject	Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject.lcsh	Microprocessors
dc.subject.lcsh	Parallel processing (Electronic computers)
dc.subject.other	Performance evaluation
dc.subject.other	Multiprocessing systems
dc.subject.other	Instruction sets
dc.subject.other	Resource allocation
dc.title	Trace-level reuse
dc.type	Conference report
dc.subject.lemac	Microprocessadors
dc.subject.lemac	Processament en paral·lel (Ordinadors)
dc.contributor.group	Universitat Politècnica de Catalunya. ARCO - Microarquitectura i Compiladors
dc.identifier.doi	10.1109/ICPP.1999.797385
dc.description.peerreviewed	Peer Reviewed
dc.relation.publisherversion	http://ieeexplore.ieee.org/document/797385/
dc.rights.access	Open Access
local.identifier.drac	2394691
dc.description.version	Postprint (published version)
local.citation.author	González, A.; Tubella, J.; Molina, C.
local.citation.contributor	International Conference on Parallel Processing
local.citation.pubplace	Aizu-Wakamatsu
local.citation.publicationName	1999 InternationaI Conference on Parallel Processing: 21-24 September 1999, Aizu-Wakamatsu City, Japan: proceedings
local.citation.startingPage	30
local.citation.endingPage	37

Fitxers d'aquest items

Nom:: 00797385.pdf
Mida:: 50,54Kb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Ponències/Comunicacions de congressos [187]
Ponències/Comunicacions de congressos [1.954]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Trace-level reuse

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora