Now showing items 1-8 of 8

    • Analysis of threading libraries for high performance computing 

      Castello, Adrián; Mayo, Rafael; Sangmin, Seo; Pavan, Balaji; Quintana-Ortí, Enrique S.; Peña, Antonio (IEEE, 2020)
      Article
      Open Access
      With the appearance of multi-many core machines, applications and runtime systems evolved in order to exploit the new on-node concurrency that brought new software paradigms. POSIX threads (Pthreads) was widely-adopted for ...
    • DMR API: improving cluster productivity by turning applications into malleable 

      Iserte, Sergio; Mayo, Rafael; Quintana-Ortí, Enrique S.; Beltran, vicenç; Peña, Antonio (Elsevier, 2018-10)
      Article
      Open Access
      Adaptive workloads can change on–the–fly the configuration of their jobs, in terms of number of processes. To carry out these job reconfigurations, we have designed a methodology which enables a job to communicate with the ...
    • DMRlib: Easy-coding and efficient resource management for job malleability 

      Iserte, Sergio; Mayo, Rafael; Quintana-Orti, Enrique S.; Peña, Antonio (Institute of Electrical and Electronics Engineers (IEEE), 2020)
      Article
      Open Access
      Process malleability has proved to have a highly positive impact on the resource utilization and global productivity in data centers compared with the conventional static resource allocation policy. However, the non-negligible ...
    • Dynamic reconfiguration of noniterative scientific applications A case study with HPG aligner 

      Iserte, Sergio; Martínez Pérez, Héctor; Barrachina Mir, Sergio; Castillo Catalán, Maribel; Mayo, Rafael; Peña, Antonio (SAGE Publications, 2018)
      Article
      Open Access
      Several studies have proved the benefits of job malleability, that is, the capacity of an application to adapt its parallelism to a dynamically changing number of allocated processors. The most remarkable advantages of ...
    • Enabling homomorphically encrypted inference for large DNN models 

      Lloret Talavera, Guillermo; Jorda, Marc; Servat, Harald; Boemer, Fabian; Chauhan, Chetan; Tomishima, Shigeki; Shah, Nilesh N.; Peña, Antonio (Institute of Electrical and Electronics Engineers, 2021)
      Article
      Open Access
      The proliferation of machine learning services in the last few years has raised data privacy concerns. Homomorphic encryption (HE) enables inference using encrypted data but it incurs 100x-10,000x memory and runtime ...
    • Predicate-based filtering for multi-GPU utilization in directive-based programming 

      Matsumura, Kazuaki; García De Gonzalo, Simón; Peña, Antonio (Barcelona Supercomputing Center, 2021-05)
      Conference report
      Open Access
      Designing and building supercomputers is a complex task in the field of high-performance computing (HPC). The hardware, middleware and algorithms need to effectively collaborate to achieve ideal results for massive and ...
    • Static Graphs for Coding Productivity in OpenACC 

      Toledo, Leonel; Valero Lara, Pedro; Vetter, Jeffrey; Peña, Antonio (Institute of Electrical and Electronics Engineers (IEEE), 2022)
      Conference lecture
      Open Access
      The main contribution of this work is to increase the coding productivity for GPU programming by using the concept of Static Graphs. To do so, we have combined the new CUDA Graph API with the OpenACC programming model. We ...
    • Towards enhancing coding productivity for GPU programming using static graphs 

      Toledo, Leonel; Valero Lara, Pedro; Vetter, Jeffrey S.; Peña, Antonio (MDPI, 2022)
      Article
      Open Access
      The main contribution of this work is to increase the coding productivity of GPU programming by using the concept of Static Graphs. GPU capabilities have been increasing significantly in terms of performance and memory ...