Flexible compiler-managed L0 buffers for clustered VLIW processors

Gibert Codina, Enric; Sánchez Navarro, F. Jesús; González Colás, Antonio María

doi:10.1109/MICRO.2003.1253205

dc.contributor.author	Gibert Codina, Enric
dc.contributor.author	Sánchez Navarro, F. Jesús
dc.contributor.author	González Colás, Antonio María
dc.contributor.other	Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned	2016-11-11T12:35:30Z
dc.date.available	2016-11-11T12:35:30Z
dc.date.issued	2003
dc.identifier.citation	Gibert, E., Sánchez, F., González, A. Flexible compiler-managed L0 buffers for clustered VLIW processors. A: Annual IEEE/ACM International Symposium on Microarchitecture. "36th Annual IEEE/ACM International Symposium on Microarchitecture, 2003, MICRO-36: proceedings". San Diego, California: Institute of Electrical and Electronics Engineers (IEEE), 2003, p. 315-325.
dc.identifier.isbn	0-7695-2043-X
dc.identifier.uri	http://hdl.handle.net/2117/96552
dc.description.abstract	Wire delays are a major concern for current and forthcoming processors. One approach to attack this problem is to divide the processor into semi-independent units referred to as clusters. A cluster usually consists of a local register file and a subset of the functional units, while the data cache remains centralized. However, as technology evolves, the latency of such a centralized cache increase leading to an important performance impact. In this paper, we propose to include flexible low-latency buffers in each cluster in order to reduce the performance impact of higher cache latencies. The reduced number of entries in each buffer permits the design of flexible ways to map data from L1 to these buffers. The proposed L0 buffers are managed by the compiler, which is responsible to decide which memory instructions make us of them. Effective instruction scheduling techniques are proposed to generate code that exploits these buffers. Results for the Mediabench benchmark suite show that the performance of a clustered VLIW processor with a unified L1 data cache is improved by 16% when such buffers are used. In addition, the proposed architecture also shows significant advantages over both MultiVLIW processors and clustered processors with a word-interleaved cache, two state-of-the-art designs with a distributed L1 data cache.
dc.format.extent	11 p.
dc.language.iso	eng
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.subject	Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject.lcsh	Cache memory
dc.subject.lcsh	Compilers (Computer programs)
dc.subject.other	VLIW
dc.subject.other	Processor scheduling
dc.subject.other	Wire
dc.subject.other	Delay
dc.subject.other	Filters
dc.subject.other	Energy consumption
dc.subject.other	Computational modeling
dc.subject.other	Electronic mail
dc.subject.other	Microarchitecture
dc.title	Flexible compiler-managed L0 buffers for clustered VLIW processors
dc.type	Conference report
dc.subject.lemac	Memòria cau
dc.subject.lemac	Compiladors (Programes d'ordinador)
dc.contributor.group	Universitat Politècnica de Catalunya. ARCO - Microarquitectura i Compiladors
dc.identifier.doi	10.1109/MICRO.2003.1253205
dc.description.peerreviewed	Peer Reviewed
dc.relation.publisherversion	http://ieeexplore.ieee.org/document/1253205/?reload=true&arnumber=1253205&count=39&index=28
dc.rights.access	Open Access
local.identifier.drac	2453461
dc.description.version	Postprint (published version)
local.citation.author	Gibert, E.; Sánchez, F.; González, A.
local.citation.contributor	Annual IEEE/ACM International Symposium on Microarchitecture
local.citation.pubplace	San Diego, California
local.citation.publicationName	36th Annual IEEE/ACM International Symposium on Microarchitecture, 2003, MICRO-36: proceedings
local.citation.startingPage	315
local.citation.endingPage	325

Fitxers d'aquest items

Nom:: 01253205.pdf
Mida:: 295,8Kb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Ponències/Comunicacions de congressos [187]
Ponències/Comunicacions de congressos [1.954]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Flexible compiler-managed L0 buffers for clustered VLIW processors

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora