Mostra el registre d'ítem simple

dc.contributor.authorJiménez, Víctor
dc.contributor.authorBuyuktosunoglu, Alper
dc.contributor.authorBose, Pradip
dc.contributor.authorO'Connell, Francis P.
dc.contributor.authorCazorla Almeida, Francisco Javier
dc.contributor.authorValero Cortés, Mateo
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.contributor.otherBarcelona Supercomputing Center
dc.date.accessioned2016-03-29T08:35:21Z
dc.date.issued2015
dc.identifier.citationJiménez, V., Buyuktosunoglu, A., Bose, P., O'Connell, F., Cazorla, F., Valero, M. Increasing multicore system efficiency through intelligent bandwidth shifting. A: International Symposium on High-Performance Computer Architecture. "2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA 2015): Burlingame, California, USA: 7-11 February 2015". San Francisco Bay Area, California: Institute of Electrical and Electronics Engineers (IEEE), 2015, p. 39-50.
dc.identifier.isbn978-1-4799-8931-7
dc.identifier.urihttp://hdl.handle.net/2117/84776
dc.description.abstractMemory bandwidth is a crucial resource in computing systems. Current CMP/SMT processors have a significant number of cores and they can run many threads concurrently. This large thread count adds high pressure to the memory bus, which demands high bandwidth to service memory requests from the cores. Hardware data prefetching is a well-known technique for hiding memory latency. Due to its speculative nature, however, in some situations prefetching does not effectively work, wasting memory bandwidth and polluting the caches. Data prefetching efficiency depends on the prefetching algorithm. It also depends on the characteristics of the applications running on the system. In this paper we propose an online bandwidth shifting mechanism that dynamically assigns bandwidth to applications according to their prefetch efficiency. This mechanism maximizes the utilization of memory bandwidth, thereby improving system performance and/or reducing memory power consumption. To the best of our knowledge, this solution is the first to not require hardware support. We evaluate the benefits of using our bandwidth shifting mechanism on a real system - the IBM POWER7. We obtain speedups in the order of 10-20% (in one instance, speedup exceeds 1.6X). Our mechanism does not generate a significant degree of unfairness among the applications. In many cases individual thread performance increases by 10-35%, while virtually no thread experiences a slowdown larger than 5%.
dc.description.sponsorshipThis s work has been partially sponsored by Defense Advanced Research Projects Agency (DARPA), Microsystems Technology Office (MTO), under contract no. HR0011-13-C- 0022. The views expressed are those of the authors and do not reflect the official policy or position of the Department of Defense or the U.S. Government. This document is: Approved for Public Release, Distribution Unlimited. This work has also received funding from: the Spanish Ministry of Science and Innovation under grant TIN2012-34557 and the HiPEAC Network of Excellence; and the European Research Council under the European Unions 7th FP (FP/2007- 2013) / ERC GA n. 321253. Additional support was received from a joint study agreement between IBM and BSC (number W1361154).
dc.format.extent12 p.
dc.language.isoeng
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.subjectÀrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject.lcshMemory management (Computer science)
dc.subject.otherMicroprocessor chips
dc.subject.otherMultiprocessing systems
dc.subject.otherStorage management
dc.titleIncreasing multicore system efficiency through intelligent bandwidth shifting
dc.typeConference report
dc.subject.lemacGestió de memòria (Informàtica)
dc.contributor.groupUniversitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi10.1109/HPCA.2015.7056020
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://ieeexplore.ieee.org/xpl/articleDetails.jsp?reload=true&arnumber=7056020
dc.rights.accessRestricted access - publisher's policy
local.identifier.drac15430250
dc.description.versionPostprint (published version)
dc.relation.projectidinfo:eu-repo/grantAgreement/EC/FP7/321253/EU/Riding on Moore's Law/ROMOL
dc.date.lift10000-01-01
local.citation.authorJiménez, V.; Buyuktosunoglu, A.; Bose, P.; O'Connell, F.; Cazorla, F.; Valero, M.
local.citation.contributorInternational Symposium on High-Performance Computer Architecture
local.citation.pubplaceSan Francisco Bay Area, California
local.citation.publicationName2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA 2015): Burlingame, California, USA: 7-11 February 2015
local.citation.startingPage39
local.citation.endingPage50


Fitxers d'aquest items

Imatge en miniatura

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple