Hybrid access-specific software cache techniques for the cell BE architecture

O’Brien, Kathryn; O'Brien, Kevin; González Tallada, Marc; Vujic, Nikola; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Eichenberger, Alexandre E.; Chen, Tong; Sura, Zehra; Zhang, Tao

doi:10.1145/1454115.1454156

dc.contributor.author	O’Brien, Kathryn
dc.contributor.author	O'Brien, Kevin
dc.contributor.author	González Tallada, Marc
dc.contributor.author	Vujic, Nikola
dc.contributor.author	Martorell Bofill, Xavier
dc.contributor.author	Ayguadé Parra, Eduard
dc.contributor.author	Eichenberger, Alexandre E.
dc.contributor.author	Chen, Tong
dc.contributor.author	Sura, Zehra
dc.contributor.author	Zhang, Tao
dc.contributor.other	Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned	2012-04-10T11:49:13Z
dc.date.available	2012-04-10T11:49:13Z
dc.date.created	2008
dc.date.issued	2008
dc.identifier.citation	González, M. [et al.]. Hybrid access-specific software cache techniques for the cell BE architecture. A: International Conference on Parallel Architectures and Compilation Techniques. "PACT'08. Proceedings of the Seventeenth International Conference on Parallel Architectures and Compilation Techniques". Toronto: Association for Computing Machinery, 2008, p. 292-302.
dc.identifier.isbn	978-1-60558-282-5
dc.identifier.uri	http://hdl.handle.net/2117/15715
dc.description.abstract	Ease of programming is one of the main impediments for the broad acceptance of multi-core systems with no hardware support for transparent data transfer between local and global memories. Software cache is a robust approach to provide the user with a transparent view of the memory architecture; but this software approach can suffer from poor performance. In this paper, we propose a hierarchical, hybrid software-cache architecture that classifies at compile time memory accesses in two classes, highlocality and irregular. Our approach then steers the memory references toward one of two specific cache structures optimized for their respective access pattern. The specific cache structures are optimized to enable high-level compiler optimizations to aggressively unroll loops, reorder cache references, and/or transform surrounding loops so as to practically eliminate the software cache overhead in the innermost loop. Performance evaluation indicates that improvements due to the optimized software-cache structures combined with the proposed codeoptimizations translate into 3.5 to 8.4 speedup factors, compared to a traditional software cache approach. As a result, we demonstrate that the Cell BE processor can be a competitive alternative to a modern server-class multi-core such as the IBM Power5 processor for a set of parallel NAS applications.
dc.format.extent	11 p.
dc.language.iso	eng
dc.publisher	Association for Computing Machinery
dc.subject	Àrees temàtiques de la UPC::Informàtica::Enginyeria del software
dc.subject.lcsh	Cache memory
dc.subject.lcsh	Compilers (Computer programs)
dc.subject.other	OpenMP
dc.subject.other	Compiler optimizations
dc.subject.other	Local memories
dc.subject.other	Memory classification
dc.subject.other	Software cache
dc.title	Hybrid access-specific software cache techniques for the cell BE architecture
dc.type	Conference lecture
dc.subject.lemac	Memòria cau
dc.subject.lemac	Compiladors (Programes d'ordinador)
dc.contributor.group	Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi	10.1145/1454115.1454156
dc.description.peerreviewed	Peer Reviewed
dc.rights.access	Restricted access - publisher's policy
local.identifier.drac	2383697
dc.description.version	Postprint (published version)
local.citation.author	González, M.; Vujic, N.; Martorell, X.; Ayguade, E.; Eichenberger, A.; Chen, T.; Sura, Z.; Zhang, T.; O'Brien, K.; O’Brien, K.
local.citation.contributor	International Conference on Parallel Architectures and Compilation Techniques
local.citation.pubplace	Toronto
local.citation.publicationName	PACT'08. Proceedings of the Seventeenth International Conference on Parallel Architectures and Compilation Techniques
local.citation.startingPage	292
local.citation.endingPage	302

Fitxers d'aquest items

Nom:: p292-gonzalez.pdf
Mida:: 853,2Kb
Format:: PDF
Descripció:: PACT 2008

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Ponències/Comunicacions de congressos [784]
Ponències/Comunicacions de congressos [1.954]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Hybrid access-specific software cache techniques for the cell BE architecture

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora