Mostra el registre d'ítem simple

dc.contributor.authorAlastruey, Jesús
dc.contributor.authorMonreal Arnal, Teresa
dc.contributor.authorCazorla Almeida, Francisco Javier
dc.contributor.authorViñals Yufera, Víctor
dc.contributor.authorValero Cortés, Mateo
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned2011-09-21T11:11:02Z
dc.date.available2011-09-21T11:11:02Z
dc.date.created2008
dc.date.issued2008
dc.identifier.citationAlastruey, J.; Monreal, T.; Cazorla, F.; Viñals, V.; Valero, M. Selection of the register file size and the resource policy on SMT processors. A: International Symposium on Computer Architecture and High Performance Computing. "20th International Symposium on Computer Architecture and High Performance Computing: Campo Grande, MS, Brazil, 29 October 2008-1 November 2008: proceedings". IEEE Computer Society, 2008, p. 63-70.
dc.identifier.isbn9780769534237
dc.identifier.urihttp://hdl.handle.net/2117/13283
dc.description.abstractThe performance impact of the Physical Register File (PRF) size on Simultaneous Multithreading processors has not been extensively studied in spite of being a critical shared resource. In this paper we analyze the effect on performance of the PRF size for a broad set of resource allocation policies (Icount, Stall, Flush, Flush++, Static, Dcra and Hill-climbing) and evaluate them under two metrics: instructions per second (IPS) for throughput and harmonic mean of weighted IPCs (Hmean-wIPC) for fairness. We have found that resource allocation policy and PRF size should be considered together in order to obtain the best score in the proposed metrics. For instance, for the analyzed 2 and 4-threaded SPEC CPU2000 workloads, small PRFs are best managed by Flush, whereas for larger PRFs, Hill-climbing and Static lead to the best values for the throughput and fairness metrics, respectively. The second contribution of this work is a simple procedure that, for a given resource allocation policy, selects the PRF size that maximizes IPS and obtains for HmeanwIPC a value close to its maximum. According to our results, Hill-climbing with a 320-entry PRF achieves the best figures for 2-threaded workloads. When executing 4-threaded workloads, Hill-Climbing with a 384-entry PRF achieves the best throughput whereas Static obtains the best throughput-fairness balance.
dc.format.extent8 p.
dc.language.isoeng
dc.publisherIEEE Computer Society
dc.subjectÀrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject.lcshSimultaneous multithreading processors
dc.subject.otherFetch policies
dc.subject.otherRegister file
dc.subject.otherResource allocation
dc.titleSelection of the register file size and the resource policy on SMT processors
dc.typeConference lecture
dc.subject.lemacMultiprocessadors
dc.contributor.groupUniversitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi10.1109/SBAC-PAD.2008.17
dc.relation.publisherversionhttp://ieeexplore.ieee.org/document/4685729/
dc.rights.accessOpen Access
local.identifier.drac2433373
dc.description.versionPostprint (published version)
local.citation.authorAlastruey, J.; Monreal, T.; Cazorla, F.; Viñals, V.; Valero, M.
local.citation.contributorInternational Symposium on Computer Architecture and High Performance Computing
local.citation.pubplaceCampo Grande
local.citation.publicationName20th International Symposium on Computer Architecture and High Performance Computing: Campo Grande, MS, Brazil, 29 October 2008-1 November 2008: proceedings
local.citation.startingPage63
local.citation.endingPage70


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple