• Aging-aware parallel execution 

      Medeiros, Thiarles S.; Berned, Gustavo P.; Navarro Muñoz, Antoni; Rossi, Fábio D.; Luizelli, Marcelo C.; Brandalero, Marcelo; Hübner, Michael; Beck, Antonio Carlos S.; Lorenzon, Arthur F. (Institute of Electrical and Electronics Engineers (IEEE), 2021-09)
      Article
      Accés obert
      Computation has been pushed to the edge to decrease latency and alleviate the computational burden of the IoT applications in the cloud. However, the increasing processing demands of Edge Applications make necessary the ...
    • Combining dynamic concurrency throttling with voltage and frequency scaling on task-based programming models 

      Navarro Muñoz, Antoni; Lorenzon, Arthur F.; Ayguadé Parra, Eduard; Beltran Querol, Vicenç (Association for Computing Machinery (ACM), 2021)
      Text en actes de congrés
      Accés obert
      Being on the verge of exascale performance has shifted the prioritization of performance in applications to the inclusion of power-performance efficiency as a primary objective in the High Performance Computing (HPC) ...
    • Enhancing resource management through prediction-based policies 

      Navarro Muñoz, Antoni; Lorenzon, Arthur F.; Ayguadé Parra, Eduard; Beltran Querol, Vicenç (Springer, 2020)
      Text en actes de congrés
      Accés obert
      Task-based programming models are emerging as a promising alternative to make the most of multi-/many-core systems. These programming models rely on runtime systems, and their goal is to improve application performance by ...
    • Mitigating the NUMA effect on task-based runtime systems 

      Maroñas Bravo, Marcos; Navarro Muñoz, Antoni; Ayguadé Parra, Eduard; Beltran Querol, Vicenç (Springer Nature, 2023-09)
      Article
      Accés obert
      Processors with multiple sockets or chiplets are becoming more conventional. These kinds of processors usually expose a single shared address space. However, due to hardware restrictions, they adopt a NUMA approach, where ...
    • Seamless optimization of the GEMM kernel for task-based programming models 

      Lorenzon, Arthur F.; Marques, Sandro M. V. N.; Navarro Muñoz, Antoni; Beltran Querol, Vicenç (Association for Computing Machinery (ACM), 2022)
      Text en actes de congrés
      Accés obert
      The general matrix-matrix multiplication (GEMM) kernel is a fundamental building block of many scientific applications. Many libraries such as Intel MKL and BLIS provide highly optimized sequential and parallel versions ...