Unveiling internal evolution of parallel application computation phases
Visualitza/Obre
Unveiling Internal Evolution of Parallel Application Computation Phases .pdf (1,292Mb) (Accés restringit)
Sol·licita una còpia a l'autor
Què és aquest botó?
Aquest botó permet demanar una còpia d'un document restringit a l'autor. Es mostra quan:
- Disposem del correu electrònic de l'autor
- El document té una mida inferior a 20 Mb
- Es tracta d'un document d'accés restringit per decisió de l'autor o d'un document d'accés restringit per política de l'editorial
Cita com:
hdl:2117/24546
Tipus de documentText en actes de congrés
Data publicació2011
EditorInstitute of Electrical and Electronics Engineers (IEEE)
Condicions d'accésAccés restringit per política de l'editorial
Llevat que s'hi indiqui el contrari, els
continguts d'aquesta obra estan subjectes a la llicència de Creative Commons
:
Reconeixement-NoComercial-SenseObraDerivada 3.0 Espanya
Abstract
As access to supercomputing resources is becoming more and more commonplace, performance analysis tools are gaining importance in order to decrease the gap between the application performance and the supercomputers' peak performance. Performance analysis tools allow the analyst to understand the idiosyncrasies of an application in order to improve it. However, these tools require monitoring regions of the application to provide information to the analysts, leaving non-monitored regions of code unknown, which may result in lack of understanding of important regions of the application. In this paper we describe an automated methodology that reports very detailed application insights and improves the analysis experience of performance tools based on traces. We apply this methodology to three production applications and provide suggestions on how to improve their performance. Our methodology uses computation burst clustering and a mechanism called folding. While clustering automatically detects application structure, folding combines instrumentation and sampling to augment the performance analysis details. Folding provides fine grain performance information from coarse grain sampling on iterative applications. Folding results closely resemble the performance data gathered from fine grain sampling with an absolute mean difference less than 5% without overhead of fine grain.
CitacióServat, H. [et al.]. Unveiling internal evolution of parallel application computation phases. A: International Conference on Parallel Processing. "International Conference on Parallel Processing (ICPP), 2011: 13-16 Sept. 2011, Taipei City, Taiwan: proceedings". Taipei: Institute of Electrical and Electronics Engineers (IEEE), 2011, p. 155-164.
ISBN978-0-7695-4510-3
Versió de l'editorhttp://dl.acm.org/citation.cfm?id=2066302.2066946
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
Unveiling Inter ... on Computation Phases .pdf | Unveiling Internal Evolution of Parallel Application Computation Phases .pdf | 1,292Mb | Accés restringit |