Increasing multicore system efficiency through intelligent bandwidth shifting

Jiménez, Víctor; Buyuktosunoglu, Alper; Bose, Pradip; O'Connell, Francis P.; Cazorla Almeida, Francisco Javier; Valero Cortés, Mateo

doi:10.1109/HPCA.2015.7056020

dc.contributor.author	Jiménez, Víctor
dc.contributor.author	Buyuktosunoglu, Alper
dc.contributor.author	Bose, Pradip
dc.contributor.author	O'Connell, Francis P.
dc.contributor.author	Cazorla Almeida, Francisco Javier
dc.contributor.author	Valero Cortés, Mateo
dc.contributor.other	Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.contributor.other	Barcelona Supercomputing Center
dc.date.accessioned	2016-03-29T08:35:21Z
dc.date.issued	2015
dc.identifier.citation	Jiménez, V., Buyuktosunoglu, A., Bose, P., O'Connell, F., Cazorla, F., Valero, M. Increasing multicore system efficiency through intelligent bandwidth shifting. A: International Symposium on High-Performance Computer Architecture. "2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA 2015): Burlingame, California, USA: 7-11 February 2015". San Francisco Bay Area, California: Institute of Electrical and Electronics Engineers (IEEE), 2015, p. 39-50.
dc.identifier.isbn	978-1-4799-8931-7
dc.identifier.uri	http://hdl.handle.net/2117/84776
dc.description.abstract	Memory bandwidth is a crucial resource in computing systems. Current CMP/SMT processors have a significant number of cores and they can run many threads concurrently. This large thread count adds high pressure to the memory bus, which demands high bandwidth to service memory requests from the cores. Hardware data prefetching is a well-known technique for hiding memory latency. Due to its speculative nature, however, in some situations prefetching does not effectively work, wasting memory bandwidth and polluting the caches. Data prefetching efficiency depends on the prefetching algorithm. It also depends on the characteristics of the applications running on the system. In this paper we propose an online bandwidth shifting mechanism that dynamically assigns bandwidth to applications according to their prefetch efficiency. This mechanism maximizes the utilization of memory bandwidth, thereby improving system performance and/or reducing memory power consumption. To the best of our knowledge, this solution is the first to not require hardware support. We evaluate the benefits of using our bandwidth shifting mechanism on a real system - the IBM POWER7. We obtain speedups in the order of 10-20% (in one instance, speedup exceeds 1.6X). Our mechanism does not generate a significant degree of unfairness among the applications. In many cases individual thread performance increases by 10-35%, while virtually no thread experiences a slowdown larger than 5%.
dc.description.sponsorship	This s work has been partially sponsored by Defense Advanced Research Projects Agency (DARPA), Microsystems Technology Office (MTO), under contract no. HR0011-13-C- 0022. The views expressed are those of the authors and do not reflect the official policy or position of the Department of Defense or the U.S. Government. This document is: Approved for Public Release, Distribution Unlimited. This work has also received funding from: the Spanish Ministry of Science and Innovation under grant TIN2012-34557 and the HiPEAC Network of Excellence; and the European Research Council under the European Unions 7th FP (FP/2007- 2013) / ERC GA n. 321253. Additional support was received from a joint study agreement between IBM and BSC (number W1361154).
dc.format.extent	12 p.
dc.language.iso	eng
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.subject	Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject.lcsh	Memory management (Computer science)
dc.subject.other	Microprocessor chips
dc.subject.other	Multiprocessing systems
dc.subject.other	Storage management
dc.title	Increasing multicore system efficiency through intelligent bandwidth shifting
dc.type	Conference report
dc.subject.lemac	Gestió de memòria (Informàtica)
dc.contributor.group	Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi	10.1109/HPCA.2015.7056020
dc.description.peerreviewed	Peer Reviewed
dc.relation.publisherversion	http://ieeexplore.ieee.org/xpl/articleDetails.jsp?reload=true&arnumber=7056020
dc.rights.access	Restricted access - publisher's policy
local.identifier.drac	15430250
dc.description.version	Postprint (published version)
dc.relation.projectid	info:eu-repo/grantAgreement/EC/FP7/321253/EU/Riding on Moore's Law/ROMOL
dc.date.lift	10000-01-01
local.citation.author	Jiménez, V.; Buyuktosunoglu, A.; Bose, P.; O'Connell, F.; Cazorla, F.; Valero, M.
local.citation.contributor	International Symposium on High-Performance Computer Architecture
local.citation.pubplace	San Francisco Bay Area, California
local.citation.publicationName	2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA 2015): Burlingame, California, USA: 7-11 February 2015
local.citation.startingPage	39
local.citation.endingPage	50

Fitxers d'aquest items

Nom:: 07056020.pdf
Mida:: 172,1Kb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Ponències/Comunicacions de congressos [574]
Ponències/Comunicacions de congressos [784]
Ponències/Comunicacions de congressos [1.954]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Increasing multicore system efficiency through intelligent bandwidth shifting

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora