Mostra el registre d'ítem simple

dc.contributor.authorBeltran Querol, Vicenç
dc.contributor.authorAyguadé Parra, Eduard
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned2013-03-15T09:57:14Z
dc.date.created2013
dc.date.issued2013
dc.identifier.citationBeltran, V.; Ayguade, E. Optimizing resource utilization with software-based temporal multi-threading (sTMT). A: International Conference on High Performance Computing. "19th International Conference on High Performance Computing". Pune: 2013, p. 1-10.
dc.identifier.isbn978-1-4673-2371-0/12
dc.identifier.urihttp://hdl.handle.net/2117/18335
dc.description.abstractCompute and memory access units are two of the most important resources to appropriately manage in current and future multi–/many–core architectures. Memory bandwidth and computational capacity need to be exploited in a combined way to achieve the best system performance. Coarse–grain multi– threading, also known as temporal multi–threading (TMT), is a well known technique that improves overall resource utilization by time–multiplexing the execution of a reduced number of hardware threads that are switched in case of a high–latency event, such as a memory miss. Hence, the processor does not stall on memory misses and the number of in–fly memory operations is increased, improving the overall processor resource utilization. In this paper, we propose a software–based implementation of TMT that supports and unbounded number of threads and enables a flexible combination of multiple computational kernels. Our TMT implementation is based on micro–threads that combine fast cooperative and preemptive context switches to overcome some intrinsic limitations of current TMT hardware implementations, such as the reduced and fixed number of hardware threads available. Our proposal is demonstrated with an implementation on the Cell/B.E. which is evaluated using heterogeneous mixes of memory–/CPU–bound kernels. Experimental results show how the proposed technique reduce the execution time of several benchmarks by up to 78%.
dc.format.extent10 p.
dc.language.isoeng
dc.subjectÀrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject.lcshComputer architecture
dc.subject.lcshHigh performance computing
dc.titleOptimizing resource utilization with software-based temporal multi-threading (sTMT)
dc.typeConference report
dc.subject.lemacArquitectura d'ordinadors
dc.subject.lemacCàlcul intensiu (Informàtica)
dc.contributor.groupUniversitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.description.peerreviewedPeer Reviewed
dc.rights.accessRestricted access - publisher's policy
local.identifier.drac11622554
dc.description.versionPostprint (published version)
dc.date.lift10000-01-01
local.citation.authorBeltran, V.; Ayguade, E.
local.citation.contributorInternational Conference on High Performance Computing
local.citation.pubplacePune
local.citation.publicationName19th International Conference on High Performance Computing
local.citation.startingPage1
local.citation.endingPage10


Fitxers d'aquest items

Imatge en miniatura

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple