Exploració per autor "Peña, Antonio"
Ara es mostren els items 1-11 de 11
-
Analysis of threading libraries for high performance computing
Castello, Adrián; Mayo, Rafael; Sangmin, Seo; Pavan, Balaji; Quintana Ortí, Enrique Salvador; Peña, Antonio (IEEE, 2020)
Article
Accés obertWith the appearance of multi-many core machines, applications and runtime systems evolved in order to exploit the new on-node concurrency that brought new software paradigms. POSIX threads (Pthreads) was widely-adopted for ... -
DMR API: improving cluster productivity by turning applications into malleable
Iserte, Sergio; Mayo, Rafael; Quintana Ortí, Enrique Salvador; Beltran, vicenç; Peña, Antonio (Elsevier, 2018-10)
Article
Accés obertAdaptive workloads can change on–the–fly the configuration of their jobs, in terms of number of processes. To carry out these job reconfigurations, we have designed a methodology which enables a job to communicate with the ... -
DMRlib: Easy-coding and efficient resource management for job malleability
Iserte, Sergio; Mayo, Rafael; Quintana Ortí, Enrique Salvador; Peña, Antonio (Institute of Electrical and Electronics Engineers (IEEE), 2020)
Article
Accés obertProcess malleability has proved to have a highly positive impact on the resource utilization and global productivity in data centers compared with the conventional static resource allocation policy. However, the non-negligible ... -
DPU Offloading Programming with the OpenMP API
Usman, Muhammad; Iserte, Sergio; Ferrer Ibañez, Roger; Peña, Antonio (Association for Computing Machinery (ACM), 2023-11)
Comunicació de congrés
Accés obertData processing units (DPUs) as network co-processors are an emerging trend in our community, with plenty of opportunities yet to be explored. These have been generally used as domain-specific accelerators transparent to ... -
Dynamic reconfiguration of noniterative scientific applications A case study with HPG aligner
Iserte, Sergio; Martínez Pérez, Héctor; Barrachina Mir, Sergio; Castillo Catalán, Maribel; Mayo, Rafael; Peña, Antonio (SAGE Publications, 2018)
Article
Accés obertSeveral studies have proved the benefits of job malleability, that is, the capacity of an application to adapt its parallelism to a dynamically changing number of allocated processors. The most remarkable advantages of ... -
Enabling homomorphically encrypted inference for large DNN models
Lloret Talavera, Guillermo; Jorda, Marc; Servat, Harald; Boemer, Fabian; Chauhan, Chetan; Tomishima, Shigeki; Shah, Nilesh N.; Peña, Antonio (Institute of Electrical and Electronics Engineers, 2021)
Article
Accés obertThe proliferation of machine learning services in the last few years has raised data privacy concerns. Homomorphic encryption (HE) enables inference using encrypted data but it incurs 100x-10,000x memory and runtime ... -
OmpSs-2 and OpenACC interoperation
Korakitis, Orestis; Garcia de Gonzalo, Simon; Guidotti, Nicolas; Barreto, João; Monteiro, José; Peña, Antonio (Institute of Electrical and Electronics Engineers (IEEE), 2023)
Comunicació de congrés
Accés obertWe propose an interoperation mechanism to enable novel composability across pragma-based programming models. We study and propose a clear separation of duties and implement our approach by augmenting the OmpSs-2 programming ... -
Predicate-based filtering for multi-GPU utilization in directive-based programming
Matsumura, Kazuaki; García De Gonzalo, Simón; Peña, Antonio (Barcelona Supercomputing Center, 2021-05)
Text en actes de congrés
Accés obertDesigning and building supercomputers is a complex task in the field of high-performance computing (HPC). The hardware, middleware and algorithms need to effectively collaborate to achieve ideal results for massive and ... -
Static Graphs for Coding Productivity in OpenACC
Toledo, Leonel; Valero Lara, Pedro; Vetter, Jeffrey; Peña, Antonio (Institute of Electrical and Electronics Engineers (IEEE), 2022)
Comunicació de congrés
Accés obertThe main contribution of this work is to increase the coding productivity for GPU programming by using the concept of Static Graphs. To do so, we have combined the new CUDA Graph API with the OpenACC programming model. We ... -
Towards enhancing coding productivity for GPU programming using static graphs
Toledo, Leonel; Valero Lara, Pedro; Vetter, Jeffrey S.; Peña, Antonio (MDPI, 2022)
Article
Accés obertThe main contribution of this work is to increase the coding productivity of GPU programming by using the concept of Static Graphs. GPU capabilities have been increasing significantly in terms of performance and memory ... -
Towards OmpSs-2 and OpenACC interoperation
Korakitis, Orestis; García De Gonzalo, Simón; Guidotti, Nicolas; Barreto, João Pedro; Monteiro, José C.; Peña, Antonio (Association for Computing Machinery, 2022)
Comunicació de congrés
Accés obertThe increasing demand in HPC to utilize accelerators has motivated the development of pragma-based directives to target these devices. OmpSs-2 and OpenACC are both directive-based solutions that allow application programmers ...