The effect of code reordering on branch prediction

Ramírez Bellido, Alejandro; Larriba Pey, Josep; Valero Cortés, Mateo

doi:10.1109/PACT.2000.888343

dc.contributor.author	Ramírez Bellido, Alejandro
dc.contributor.author	Larriba Pey, Josep
dc.contributor.author	Valero Cortés, Mateo
dc.contributor.other	Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned	2017-12-15T09:13:38Z
dc.date.available	2017-12-15T09:13:38Z
dc.date.issued	2000
dc.identifier.citation	Ramírez, A., Larriba, J., Valero, M. The effect of code reordering on branch prediction. A: International Conference on Parallel Architectures and Compilation Techniques. "2000 International Conference on Parallel Architectures and Compilation Techniques: October 15-19, 2000, Philadelphia, Pennsylvania: proceedings". Philadelphia: Institute of Electrical and Electronics Engineers (IEEE), 2000, p. 189-198.
dc.identifier.isbn	0-7695-0622-4
dc.identifier.uri	http://hdl.handle.net/2117/112120
dc.description.abstract	Branch prediction accuracy is a very important factor for superscalar processor performance. The ability to predict the outcome of a branch allows the processor to effectively use a large instruction window, and extract a larger amount of Instruction Level Parallelism (ILP). In this paper we will examine the effect of code layout optimizations on branch prediction accuracy and final processor performance. These code reordering techniques align branches so that they tend to be not taken, achieving better instruction cache performance and increasing the fetch bandwidth. Here we focus on how these optimizations affect both static and dynamic branch prediction. Code reordering mainly increases the number of not taken branches, which benefits simple static predictors, which reach over 80% prediction accuracy with optimized codes. This branch direction change produces no effects on dynamic branch prediction: on the positive side, trades negative interference for neutral or positive interference in the prediction tables; on the negative side, it causes a worse distribution of the Branch History Register (BHR), causing many possible history values to be unused. Our results show that code reordering reduces negative Pattern History Table (PHT) interference, increasing branch prediction accuracy on small branch predictors. For example, a 0.5 KB gshare improves from 91.4% to 93.6%, and a 0.4 KB gskew predictor from 93.5% to 94.4%. For larger history lengths, the large amount of not taken branches can degrade predictor performance on dealiased schemes, like the 16 KB agree predictor which goes from 96.2% to 95.8%. But processor performance not only depends on branch prediction accuracy. Layout optimized codes have much better instruction cache performance, and wider fetch bandwidth. Our results show that when all three factors are considered together; code reordering techniques always improve processor performance. For example, performance still increases by 8% with an agree predictor; which loses prediction accuracy, and it increases by 9% with a gshare predictor, which increases prediction accuracy.
dc.format.extent	10 p.
dc.language.iso	eng
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.subject	Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject.lcsh	Compilers (Computer programs)
dc.subject.lcsh	Parallel processing (Electronic computers)
dc.subject.other	Performance evaluation
dc.subject.other	Parallel architectures
dc.subject.other	Program compilers
dc.title	The effect of code reordering on branch prediction
dc.type	Conference report
dc.subject.lemac	Compiladors (Programes d'ordinador)
dc.subject.lemac	Processament en paral·lel (Ordinadors)
dc.contributor.group	Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi	10.1109/PACT.2000.888343
dc.description.peerreviewed	Peer Reviewed
dc.relation.publisherversion	http://ieeexplore.ieee.org/document/888343/
dc.rights.access	Open Access
local.identifier.drac	2402679
dc.description.version	Postprint (published version)
local.citation.author	Ramírez, A.; Larriba, J.; Valero, M.
local.citation.contributor	International Conference on Parallel Architectures and Compilation Techniques
local.citation.pubplace	Philadelphia
local.citation.publicationName	2000 International Conference on Parallel Architectures and Compilation Techniques: October 15-19, 2000, Philadelphia, Pennsylvania: proceedings
local.citation.startingPage	189
local.citation.endingPage	198

Fitxers d'aquest items

Nom:: 00888343.pdf
Mida:: 981,9Kb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Ponències/Comunicacions de congressos [784]
Ponències/Comunicacions de congressos [1.954]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

The effect of code reordering on branch prediction

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora