Ir al contenido (pulsa Retorno)

Universitat Politècnica de Catalunya

    • Català
    • Castellano
    • English
    • LoginRegisterLog in (no UPC users)
  • mailContact Us
  • world English 
    • Català
    • Castellano
    • English
  • userLogin   
      LoginRegisterLog in (no UPC users)

UPCommons. Global access to UPC knowledge

57.066 UPC E-Prints
You are here:
View Item 
  •   DSpace Home
  • E-prints
  • Centres de recerca
  • BSC - Barcelona Supercomputing Center
  • Computer Sciences
  • Capítols de llibre
  • View Item
  •   DSpace Home
  • E-prints
  • Centres de recerca
  • BSC - Barcelona Supercomputing Center
  • Computer Sciences
  • Capítols de llibre
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

The secrets of the accelerators unveiled: tracing heterogeneous executions through OMPT

Thumbnail
View/Open
The secrets of the accelerators unveiled.pdf (1,039Mb) (Restricted access)   Request copy 

Què és aquest botó?

Aquest botó permet demanar una còpia d'un document restringit a l'autor. Es mostra quan:

  • Disposem del correu electrònic de l'autor
  • El document té una mida inferior a 20 Mb
  • Es tracta d'un document d'accés restringit per decisió de l'autor o d'un document d'accés restringit per política de l'editorial
Share:
 
 
10.1007/978-3-319-45550-1_16
 
  View Usage Statistics
Cita com:
hdl:2117/91298

Show full item record
Llort, German
Filgueras Izquierdo, AntonioMés informació
Jiménez-González, Daniel
Servat, Harald
Teruel, Xavier
Mercadal, Estanislao
Álvarez, Carlos
Giménez, Judit
Martorell Bofill, XavierMés informacióMés informacióMés informació
Ayguadé Parra, EduardMés informacióMés informacióMés informació
Labarta Mancho, Jesús JoséMés informacióMés informacióMés informació
Document typeConference report
Defense date2016
PublisherSpringer
Rights accessRestricted access - publisher's policy
All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder
ProjectAXIOM - Agile, eXtensible, fast I%2FO Module for the cyber-physical era (EC-H2020-645496)
COMPUTACION DE ALTAS PRESTACIONES VII (MINECO-TIN2015-65316-P)
AXIOM - Agile, eXtensible, fast I%2FO Module for the cyber-physical era (EC-H2020-645496)
COMPUTACION DE ALTAS PRESTACIONES VII (MINECO-TIN2015-65316-P)
Abstract
Heterogeneous systems are an important trend in the future of supercomputers, yet they can be hard to program and developers still lack powerful tools to gain understanding about how well their accelerated codes perform and how to improve them. Having different types of hardware accelerators available, each with their own specific low-level APIs to program them, there is not yet a clear consensus on a standard way to retrieve information about the accelerator’s performance. To improve this scenario, OMPT is a novel performance monitoring interface that is being considered for integration into the OpenMP standard. OMPT allows analysis tools to monitor the execution of parallel OpenMP applications by providing detailed information about the activity of the runtime through a standard API. For accelerated devices, OMPT also facilitates the exchange of performance information between the runtime and the analysis tool. We implement part of the OMPT specification that refers to the use of accelerators both in the Nanos++ parallel runtime system and the Extrae tracing framework, obtaining detailed performance information about the execution of the tasks issued to the accelerated devices to later conduct insightful analysis. Our work extends previous efforts in the field to expose detailed information from the OpenMP and OmpSs runtimes, regarding the activity and performance of task-based parallel applications. In this paper, we focus on the evaluation of FPGA devices studying the performance of two common kernels in scientific algorithms: matrix multiplication and Cholesky decomposition. Furthermore, this development is seamlessly applicable for the analysis of GPGPU accelerators and Intel®Xeon PhiTM co-processors operating under the OmpSs programming model.
CitationLlort, G., Filgueras, A., Jiménez-González, D., Servat, H., Teruel, X., Mercadal, E., Álvarez, C., Giménez, J., Martorell, X., Ayguade, E., Labarta, J. The secrets of the accelerators unveiled: tracing heterogeneous executions through OMPT. A: International Workshop on OpenMP. "OpenMP: memory, devices, and tasks: 12th International Workshop on OpenMP: IWOMP 2016: Nara, Japan: October 5-7, 2016: proceedings". Nara: Springer, 2016, p. 217-236. 
URIhttp://hdl.handle.net/2117/91298
DOI10.1007/978-3-319-45550-1_16
ISBN978-3-319-45549-5
Publisher versionhttp://link.springer.com/chapter/10.1007%2F978-3-319-45550-1_16
Collections
  • Computer Sciences - Capítols de llibre [21]
  • CAP - Grup de Computació d'Altes Prestacions - Ponències/Comunicacions de congressos [762]
  • Departament d'Arquitectura de Computadors - Ponències/Comunicacions de congressos [1.773]
Share:
 
  View Usage Statistics

Show full item record

FilesDescriptionSizeFormatView
The secrets of the accelerators unveiled.pdfBlocked1,039MbPDFRestricted access

Browse

This CollectionBy Issue DateAuthorsOther contributionsTitlesSubjectsThis repositoryCommunities & CollectionsBy Issue DateAuthorsOther contributionsTitlesSubjects

© UPC Obrir en finestra nova . Servei de Biblioteques, Publicacions i Arxius

info.biblioteques@upc.edu

  • About This Repository
  • Contact Us
  • Send Feedback
  • Inici de la pàgina