Mostra el registre d'ítem simple
AMA: asynchronous management of accelerators for task-based programming models
dc.contributor.author | Planas, Judit |
dc.contributor.author | Badia Sala, Rosa Maria |
dc.contributor.author | Ayguadé Parra, Eduard |
dc.contributor.author | Labarta Mancho, Jesús José |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors |
dc.date.accessioned | 2017-05-10T11:30:07Z |
dc.date.available | 2017-05-10T11:30:07Z |
dc.date.issued | 2015 |
dc.identifier.citation | Planas, J., Badia, R.M., Ayguadé, E., Labarta, J. AMA: asynchronous management of accelerators for task-based programming models. A: International Conference on Computational Science. "Procedia Computer Science (Vol. 51, 2015)". Reykjavík: Elsevier, 2015, p. 130-139. |
dc.identifier.isbn | 1877-0509 |
dc.identifier.uri | http://hdl.handle.net/2117/104266 |
dc.description.abstract | Computational science has benefited in the last years from emerging accelerators that increase the performance of scientific simulations, but using these devices hinders the programming task. This paper presents AMA: a set of optimization techniques to efficiently manage multi-accelerator systems. AMA maximizes the overlap of computation and communication in a blocking-free way. Then, we can use such spare time to do other work while waiting for device operations. Implemented on top of a task-based framework, the experimental evaluation of AMA on a quad-GPU node shows that we reach the performance of a hand-tuned native CUDA code, with the advantage of fully hiding the device management. In addition, we obtain up to more than 2x performance speed-up with respect to the original framework implementation. |
dc.format.extent | 10 p. |
dc.language.iso | eng |
dc.publisher | Elsevier |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Spain |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors |
dc.subject.lcsh | Graphics processing units |
dc.subject.lcsh | Multiprocessors |
dc.subject.lcsh | Parallel processing (Electronic computers) |
dc.subject.other | Accelerator management |
dc.subject.other | Asynchronous devices |
dc.subject.other | Programming models |
dc.subject.other | Multi-GPU systems |
dc.title | AMA: asynchronous management of accelerators for task-based programming models |
dc.type | Conference report |
dc.subject.lemac | Multiprocessadors |
dc.subject.lemac | Processament en paral·lel (Ordinadors) |
dc.contributor.group | Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions |
dc.identifier.doi | 10.1016/j.procs.2015.05.212 |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | http://www.sciencedirect.com/science/article/pii/S1877050915010200 |
dc.rights.access | Open Access |
local.identifier.drac | 19376613 |
dc.description.version | Postprint (published version) |
local.citation.author | Planas, J.; Badia, R.M.; Ayguadé, E.; Labarta, J. |
local.citation.contributor | International Conference on Computational Science |
local.citation.pubplace | Reykjavík |
local.citation.publicationName | Procedia Computer Science (Vol. 51, 2015) |
local.citation.startingPage | 130 |
local.citation.endingPage | 139 |