Now showing items 1-2 of 2

    • Tasking in accelerators: performance evaluation 

      Toledo, Leonel; Peña, Antonio J.; Catalán, Sandra; Valero-Lara, Pedro (Institute of Electrical and Electronics Engineers (IEEE), 2019)
      Conference report
      Open Access
      In this work, we analyze the implications and results of implementing dynamic parallelism, concurrent kernels and CUDA Graphs to solve task-oriented problems. As a benchmark we propose three different methods for solving ...
    • Towards an auto-tuned and task-based SpMV (LASs Library) 

      Catalán Pallarés, Sandra; Usui, Tetsuzo; Toledo, Leonel; Martorell Bofill, Xavier; Labarta Mancho, Jesús José; Valero Lara, Pedro (Springer, 2020)
      Conference report
      Open Access
      We present a novel approach to parallelize the SpMV kernel included in LASs (Linear Algebra routines on OmpSs) library, after a deep review and analysis of several well-known approaches. LASs is based on OmpSs, a task-based ...