Optimizing resource utilization with software-based temporal multi-threading (sTMT)
Visualitza/Obre
HIPC2012.pdf (713,2Kb) (Accés restringit)
Sol·licita una còpia a l'autor
Què és aquest botó?
Aquest botó permet demanar una còpia d'un document restringit a l'autor. Es mostra quan:
- Disposem del correu electrònic de l'autor
- El document té una mida inferior a 20 Mb
- Es tracta d'un document d'accés restringit per decisió de l'autor o d'un document d'accés restringit per política de l'editorial
Estadístiques de LA Referencia / Recolecta
Inclou dades d'ús des de 2022
Cita com:
hdl:2117/18335
Tipus de documentText en actes de congrés
Data publicació2013
Condicions d'accésAccés restringit per política de l'editorial
Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i
industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva
reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets
Abstract
Compute and memory access units are two of the
most important resources to appropriately manage in current
and future multi–/many–core architectures. Memory bandwidth
and computational capacity need to be exploited in a combined
way to achieve the best system performance. Coarse–grain multi–
threading, also known as temporal multi–threading (TMT), is a
well known technique that improves overall resource utilization
by time–multiplexing the execution of a reduced number of
hardware threads that are switched in case of a high–latency
event, such as a memory miss. Hence, the processor does not stall
on memory misses and the number of in–fly memory operations
is increased, improving the overall processor resource utilization.
In this paper, we propose a software–based implementation
of TMT that supports and unbounded number of threads
and enables a flexible combination of multiple computational
kernels. Our TMT implementation is based on micro–threads
that combine fast cooperative and preemptive context switches
to overcome some intrinsic limitations of current TMT hardware
implementations, such as the reduced and fixed number of
hardware threads available. Our proposal is demonstrated with
an implementation on the Cell/B.E. which is evaluated using heterogeneous
mixes of memory–/CPU–bound kernels. Experimental
results show how the proposed technique reduce the execution
time of several benchmarks by up to 78%.
CitacióBeltran, V.; Ayguade, E. Optimizing resource utilization with software-based temporal multi-threading (sTMT). A: International Conference on High Performance Computing. "19th International Conference on High Performance Computing". Pune: 2013, p. 1-10.
ISBN978-1-4673-2371-0/12
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
HIPC2012.pdf | 713,2Kb | Accés restringit |