Ara es mostren els items 1-11 de 11

    • Analysis of threading libraries for high performance computing 

      Castello, Adrián; Mayo, Rafael; Sangmin, Seo; Pavan, Balaji; Quintana Ortí, Enrique Salvador; Peña, Antonio (IEEE, 2020)
      Article
      Accés obert
      With the appearance of multi-many core machines, applications and runtime systems evolved in order to exploit the new on-node concurrency that brought new software paradigms. POSIX threads (Pthreads) was widely-adopted for ...
    • DMR API: improving cluster productivity by turning applications into malleable 

      Iserte, Sergio; Mayo, Rafael; Quintana Ortí, Enrique Salvador; Beltran, vicenç; Peña, Antonio (Elsevier, 2018-10)
      Article
      Accés obert
      Adaptive workloads can change on–the–fly the configuration of their jobs, in terms of number of processes. To carry out these job reconfigurations, we have designed a methodology which enables a job to communicate with the ...
    • DMRlib: Easy-coding and efficient resource management for job malleability 

      Iserte, Sergio; Mayo, Rafael; Quintana Ortí, Enrique Salvador; Peña, Antonio (Institute of Electrical and Electronics Engineers (IEEE), 2020)
      Article
      Accés obert
      Process malleability has proved to have a highly positive impact on the resource utilization and global productivity in data centers compared with the conventional static resource allocation policy. However, the non-negligible ...
    • DPU Offloading Programming with the OpenMP API 

      Usman, Muhammad; Iserte, Sergio; Ferrer Ibañez, Roger; Peña, Antonio (Association for Computing Machinery (ACM), 2023-11)
      Comunicació de congrés
      Accés obert
      Data processing units (DPUs) as network co-processors are an emerging trend in our community, with plenty of opportunities yet to be explored. These have been generally used as domain-specific accelerators transparent to ...
    • Dynamic reconfiguration of noniterative scientific applications A case study with HPG aligner 

      Iserte, Sergio; Martínez Pérez, Héctor; Barrachina Mir, Sergio; Castillo Catalán, Maribel; Mayo, Rafael; Peña, Antonio (SAGE Publications, 2018)
      Article
      Accés obert
      Several studies have proved the benefits of job malleability, that is, the capacity of an application to adapt its parallelism to a dynamically changing number of allocated processors. The most remarkable advantages of ...
    • Enabling homomorphically encrypted inference for large DNN models 

      Lloret Talavera, Guillermo; Jorda, Marc; Servat, Harald; Boemer, Fabian; Chauhan, Chetan; Tomishima, Shigeki; Shah, Nilesh N.; Peña, Antonio (Institute of Electrical and Electronics Engineers, 2021)
      Article
      Accés obert
      The proliferation of machine learning services in the last few years has raised data privacy concerns. Homomorphic encryption (HE) enables inference using encrypted data but it incurs 100x-10,000x memory and runtime ...
    • OmpSs-2 and OpenACC interoperation 

      Korakitis, Orestis; Garcia de Gonzalo, Simon; Guidotti, Nicolas; Barreto, João; Monteiro, José; Peña, Antonio (Institute of Electrical and Electronics Engineers (IEEE), 2023)
      Comunicació de congrés
      Accés obert
      We propose an interoperation mechanism to enable novel composability across pragma-based programming models. We study and propose a clear separation of duties and implement our approach by augmenting the OmpSs-2 programming ...
    • Predicate-based filtering for multi-GPU utilization in directive-based programming 

      Matsumura, Kazuaki; García De Gonzalo, Simón; Peña, Antonio (Barcelona Supercomputing Center, 2021-05)
      Text en actes de congrés
      Accés obert
      Designing and building supercomputers is a complex task in the field of high-performance computing (HPC). The hardware, middleware and algorithms need to effectively collaborate to achieve ideal results for massive and ...
    • Static Graphs for Coding Productivity in OpenACC 

      Toledo, Leonel; Valero Lara, Pedro; Vetter, Jeffrey; Peña, Antonio (Institute of Electrical and Electronics Engineers (IEEE), 2022)
      Comunicació de congrés
      Accés obert
      The main contribution of this work is to increase the coding productivity for GPU programming by using the concept of Static Graphs. To do so, we have combined the new CUDA Graph API with the OpenACC programming model. We ...
    • Towards enhancing coding productivity for GPU programming using static graphs 

      Toledo, Leonel; Valero Lara, Pedro; Vetter, Jeffrey S.; Peña, Antonio (MDPI, 2022)
      Article
      Accés obert
      The main contribution of this work is to increase the coding productivity of GPU programming by using the concept of Static Graphs. GPU capabilities have been increasing significantly in terms of performance and memory ...
    • Towards OmpSs-2 and OpenACC interoperation 

      Korakitis, Orestis; García De Gonzalo, Simón; Guidotti, Nicolas; Barreto, João Pedro; Monteiro, José C.; Peña, Antonio (Association for Computing Machinery, 2022)
      Comunicació de congrés
      Accés obert
      The increasing demand in HPC to utilize accelerators has motivated the development of pragma-based directives to target these devices. OmpSs-2 and OpenACC are both directive-based solutions that allow application programmers ...