Mostra el registre d'ítem simple
Mapping parallel loops on multicore systems
dc.contributor.author | Tabik, Siham |
dc.contributor.author | Romero, Felipe |
dc.contributor.author | Utrera Iglesias, Gladys Miriam |
dc.contributor.author | Plata, Oscar |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors |
dc.date.accessioned | 2012-06-21T10:38:35Z |
dc.date.available | 2012-06-21T10:38:35Z |
dc.date.created | 2010 |
dc.date.issued | 2010 |
dc.identifier.citation | Tabik, Siham [et al.]. Mapping parallel loops on multicore systems. A: Workshop on Compilers for Parallel Computing. "15th Workshop on Compilers for Parallel Computing". Vienna: 2010. |
dc.identifier.uri | http://hdl.handle.net/2117/16116 |
dc.description.abstract | The compute nodes in contemporary HPC systems contain one or more multicore processors. As a result, these nodes constitute a shared-memory multiprocessor, often combining CMP and SMT concurrency technologies. This configuration introduces different levels of sharing in the cache hierarchy, resulting in non-uniform data sharing overheads. In this paper we analyze the data-sharing patterns that exhibit a real multithreaded application when executing on a multicore system, with emphasis in the use of the shared last level cache (LLC) for the concurrent threads. As a consequence of this study, we explore the loop mapping problem in such systems with the aim of optimizing the shared use of the the LLC by all parallel threads. We propose a three-phase loop mapping strategy that deals with workload imbalances, minimizes cache sharing interferences, and maximizes intra-core and inter-core data reuse in the cache hierarchy. Preliminary results show some benefits of our approach. However, this is a work in progress and much more research is being done. |
dc.format.extent | 13 p. |
dc.language.iso | eng |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Spain |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors |
dc.subject.lcsh | High performance computing |
dc.subject.lcsh | Multiprocessors |
dc.title | Mapping parallel loops on multicore systems |
dc.type | Conference report |
dc.subject.lemac | Càlcul intensiu (Informàtica) |
dc.subject.lemac | Multiprocessadors |
dc.contributor.group | Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions |
dc.relation.publisherversion | http://www.complang.tuwien.ac.at/cpc10/ |
dc.rights.access | Open Access |
local.identifier.drac | 5305687 |
dc.description.version | Postprint (author’s final draft) |
local.citation.author | Tabik, Siham; Romero, F.; Utrera, G.; Plata, Oscar |
local.citation.contributor | Workshop on Compilers for Parallel Computing |
local.citation.pubplace | Vienna |
local.citation.publicationName | 15th Workshop on Compilers for Parallel Computing |