Sharing the instruction cache among lean cores on an asymmetric CMP for HPC applications

Llevat que s'hi indiqui el contrari, els continguts d'aquesta obra estan subjectes a la llicència de Creative Commons : Reconeixement-NoComercial-SenseObraDerivada 3.0 Espanya

ProjecteCOMPUTACION DE ALTAS PRESTACIONES VII (MINECO-TIN2015-65316-P)
BES-2013-063925 (MINECO-BES-2013-063925)

Abstract

High performance computing (HPC) applications have parallel code sections that must scale to large numbers of cores, which makes them sensitive to serial regions. Current supercomputing systems with heterogeneous or asymmetric CMPs (ACMP) combine few high-performance big cores for serial regions, together with many low-power lean cores for throughput computing. The low requirements of HPC applications in the core front-end lead some designs, such as SMT and GPU cores, to share front-end structures including the instruction cache (I-cache). However, little work exists to analyze the benefit of sharing the I-cache among full cores, which seems compelling as a solution to reduce silicon area and power. This paper analyzes the performance, power and area impact of such a design on an ACMP with one high-performance core and multiple low-power cores. Having identified that multiple cores run the same code during parallel regions, the lean cores share the I-cache with the intent of benefiting from mutual prefetching, without increasing the average access latency. Our exploration of the multiple parameters finds the sweet spot on a wide interconnect to access the shared I-cache and the inclusion of a few line buffers to provide the required bandwidth and latency to sustain performance. The projections with McPAT and a rich set of HPC benchmarks show 11% area savings with a 5% energy reduction at no performance cost.

CitacióMilic, U. [et al.]. Sharing the instruction cache among lean cores on an asymmetric CMP for HPC applications. A: "2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)". Institute of Electrical and Electronics Engineers (IEEE), 2017, p. 3-12.

URIhttp://hdl.handle.net/2117/106841

DOI10.1109/ISPASS.2017.7975265

ISBN978-1-5386-3890-3

Versió de l'editorhttp://ieeexplore.ieee.org/document/7975265/

Col·leccions

Computer Sciences - Ponències/Comunicacions de congressos [574]

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
Sharing the Instruction Cache Among Lean Cores.pdf		367,2Kb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

Sharing the instruction cache among lean cores on an asymmetric CMP for HPC applications

Visualitza/Obre

Explora