Mostra el registre d'ítem simple

dc.contributor.authorGibert Codina, Enric
dc.contributor.authorSánchez Navarro, F. Jesús
dc.contributor.authorGonzález Colás, Antonio María
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned2016-11-11T12:35:30Z
dc.date.available2016-11-11T12:35:30Z
dc.date.issued2003
dc.identifier.citationGibert, E., Sánchez, F., González, A. Flexible compiler-managed L0 buffers for clustered VLIW processors. A: Annual IEEE/ACM International Symposium on Microarchitecture. "36th Annual IEEE/ACM International Symposium on Microarchitecture, 2003, MICRO-36: proceedings". San Diego, California: Institute of Electrical and Electronics Engineers (IEEE), 2003, p. 315-325.
dc.identifier.isbn0-7695-2043-X
dc.identifier.urihttp://hdl.handle.net/2117/96552
dc.description.abstractWire delays are a major concern for current and forthcoming processors. One approach to attack this problem is to divide the processor into semi-independent units referred to as clusters. A cluster usually consists of a local register file and a subset of the functional units, while the data cache remains centralized. However, as technology evolves, the latency of such a centralized cache increase leading to an important performance impact. In this paper, we propose to include flexible low-latency buffers in each cluster in order to reduce the performance impact of higher cache latencies. The reduced number of entries in each buffer permits the design of flexible ways to map data from L1 to these buffers. The proposed L0 buffers are managed by the compiler, which is responsible to decide which memory instructions make us of them. Effective instruction scheduling techniques are proposed to generate code that exploits these buffers. Results for the Mediabench benchmark suite show that the performance of a clustered VLIW processor with a unified L1 data cache is improved by 16% when such buffers are used. In addition, the proposed architecture also shows significant advantages over both MultiVLIW processors and clustered processors with a word-interleaved cache, two state-of-the-art designs with a distributed L1 data cache.
dc.format.extent11 p.
dc.language.isoeng
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.subjectÀrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject.lcshCache memory
dc.subject.lcshCompilers (Computer programs)
dc.subject.otherVLIW
dc.subject.otherProcessor scheduling
dc.subject.otherWire
dc.subject.otherDelay
dc.subject.otherFilters
dc.subject.otherEnergy consumption
dc.subject.otherComputational modeling
dc.subject.otherElectronic mail
dc.subject.otherMicroarchitecture
dc.titleFlexible compiler-managed L0 buffers for clustered VLIW processors
dc.typeConference report
dc.subject.lemacMemòria cau
dc.subject.lemacCompiladors (Programes d'ordinador)
dc.contributor.groupUniversitat Politècnica de Catalunya. ARCO - Microarquitectura i Compiladors
dc.identifier.doi10.1109/MICRO.2003.1253205
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://ieeexplore.ieee.org/document/1253205/?reload=true&arnumber=1253205&count=39&index=28
dc.rights.accessOpen Access
local.identifier.drac2453461
dc.description.versionPostprint (published version)
local.citation.authorGibert, E.; Sánchez, F.; González, A.
local.citation.contributorAnnual IEEE/ACM International Symposium on Microarchitecture
local.citation.pubplaceSan Diego, California
local.citation.publicationName36th Annual IEEE/ACM International Symposium on Microarchitecture, 2003, MICRO-36: proceedings
local.citation.startingPage315
local.citation.endingPage325


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple