On the instrumentation of OpenMP and OmpSs Tasking constructs
Tipus de documentText en actes de congrés
Condicions d'accésAccés obert
Parallelism has become more and more commonplace with the advent of the multicore processors. Although different parallel pro- gramming models have arisen to exploit the computing capabilities of such processors, developing applications that take benefit of these pro- cessors may not be easy. And what is worse, the performance achieved by the parallel version of the application may not be what the developer expected, as a result of a dubious ut ilization of the resources offered by the processor. We present in this paper a fruitful synergy of a shared memory parallel compiler and runtime, and a performance extraction library. The objective of this work is not only to reduce the performance analysis life-cycle when doing the parallelization of an application, but also to extend the analysis experience of the parallel application by incorporating data that is only known in the compiler and runtime side. Additionally we present performance results obtained with the execution of instrumented application and evaluate the overhead of the instrumentation.
CitacióServat, H. [et al.]. On the instrumentation of OpenMP and OmpSs Tasking constructs. A: Workshop on Productivity and Performance. "Euro-Par 2012: Parallel Processing Workshops: BDMC, CGWS, HeteroPar, HiBB, OMHI, Paraphrase, PROPER, Resilience, UCHPC, VHPC, Rhodes Islands, Greece, August 27-31, 2012: revised selected papers". Rhodes Island: Springer, 2012, p. 414-428.
Versió de l'editorhttp://link.springer.com/chapter/10.1007/978-3-642-36949-0_47