Now showing items 1-5 of 5

    • An OpenMP free agent threads implementation 

      López, Victor; Criado Ledesma, Joel; Peñacoba Veigas, Raúl; Ferrer Ibañez, Roger; Teruel García, Xavier; Garcia Gasulla, Marta (Springer, 2021)
      Conference report
      Open Access
      In this paper, we introduce a design and implementation of the free agent threads for OpenMP. These threads increase the malleability of the OpenMP programming model, offering resource managers and runtime systems flexibility ...
    • DROM: Enabling Efficient and Effortless Malleability for Resource Managers 

      D'Amico, Marco; Garcia Gasulla, Marta; López, Victor; Jokanovic, Ana; Sirvent, Raül; Corbalán González, Julita (Association for Computing Machinery (ACM), 2018-08-13)
      Conference lecture
      Open Access
      In the design of future HPC systems, research in resource management is showing an increasing interest in a more dynamic control of the available resources. It has been proven that enabling the jobs to change the number ...
    • MPI+X: task-based parallelisation and dynamic load balance of finite element assembly 

      Garcia, Marta; Houzeaux, Guillaume; Ferrer, Roger; Artigues, Antoni; López, Victor; Labarta Mancho, Jesús José; Vázquez, Mariano (Taylor & Francis, 2019-05)
      Article
      Open Access
      The main computing phases of numerical methods for solving partial differential equations are the algebraic system assembly and the iterative solver. This work focuses on the first task, in the context of a hybrid MPI+X ...
    • MPI+X: task-based parallelization and dynamic load balance of finite element assembly 

      Garcia-Gasulla, Marta; Houzeaux, Guillaume; Ferrer, Roger; Artigues, Antoni; López, Victor; Labarta Mancho, Jesús José; Vázquez, Mariano (Taylor & Francis, 2018)
      Article
      Open Access
      The main computing tasks of a finite element code(FE) for solving partial differential equations (PDE's) are the algebraic system assembly and the iterative solver. This work focuses on the first task, in the context of ...
    • Performance analysis and optimization of the FFTXlib on the Intel knights landing architecture 

      Wagner, Michael; López, Victor; Morillo, Julian; Cavazzoni, Carlo; Affinito, Fabio; Gimenez, Judit; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2017)
      Conference report
      Open Access
      In this paper, we address the decreasing performance of the FFTXlib, the Fast Fourier Transformation (FFT) kernel of Quantum ESPRESSO, when scaling to a full KNL node. An increased performance in the FFTXlib will likewise ...