Now showing items 1-7 of 7

    • A case for malleable thread-level linear algebra libraries: The LU factorization with partial pivoting 

      Catalán Pallarés, Sandra; Herrero Zaragoza, José Ramón; Quintana Ortí, Enrique Salvador; Rodríguez Sánchez, Rafael; Van De Geijn, Robert (Institute of Electrical and Electronics Engineers (IEEE), 2019-01-31)
      Article
      Open Access
      We propose two novel techniques for overcoming load-imbalance encountered when implementing so-called look-ahead mechanisms in relevant dense matrix factorizations for the solution of linear systems. Both techniques target ...
    • BLAS-3 optimized by OmpSs regions (LASs library) 

      Valero Lara, Pedro; Catalán Pallarés, Sandra; Martorell Bofill, Xavier; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2019)
      Conference report
      Open Access
      In this paper we propose a set of optimizations for the BLAS-3 routines of LASs library (Linear Algebra routines on OmpSs) and perform a detailed analysis of the impact of the proposed changes in terms of performance and ...
    • Look-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVD 

      Rodríguez Sánchez, Rafael; Catalán Pallarés, Sandra; Herrero Zaragoza, José Ramón; Quintana Ortí, Enrique Salvador; Tomás Domínguez, Andrés Enrique (2019-02-01)
      Article
      Open Access
      We address the reduction to compact band forms, via unitary similaritytransformations, for the solution of symmetric eigenvalue problems and the compu-tation of the singular value decomposition (SVD). Concretely, in the ...
    • sLASs: a fully automatic auto-tuned linear algebra library based on OpenMP extensions implemented in OmpSs (LASs Library) 

      Valero Lara, Pedro; Catalán Pallarés, Sandra; Martorell Bofill, Xavier; Usui, Tetsuzo; Labarta Mancho, Jesús José (Elsevier, 2020-04-01)
      Article
      Restricted access - publisher's policy
      In this work we have implemented a novel Linear Algebra Library on top of the task-based runtime OmpSs-2. We have used some of the most advanced OmpSs-2 features; weak dependencies and regions, together with the final ...
    • Static scheduling of the LU factorization with look-ahead on asymmetric multicore processors 

      Catalán Pallarés, Sandra; Herrero Zaragoza, José Ramón; Quintana Ortí, Enrique Salvador; Rodríguez Sánchez, Rafael (2018-08)
      Article
      Open Access
      We analyze the benefits of look-ahead in the parallel execution of the LU factorization with partial pivoting (LUpp) in two distinct “asymmetric” multicore scenarios. The first one corresponds to an actual hardware-asymmetric ...
    • Towards an auto-tuned and task-based SpMV (LASs Library) 

      Catalán Pallarés, Sandra; Usui, Tetsuzo; Toledo, Leonel; Martorell Bofill, Xavier; Labarta Mancho, Jesús José; Valero Lara, Pedro (Springer, 2020)
      Conference report
      Open Access
      We present a novel approach to parallelize the SpMV kernel included in LASs (Linear Algebra routines on OmpSs) library, after a deep review and analysis of several well-known approaches. LASs is based on OmpSs, a task-based ...
    • Two-sided orthogonal reductions to condensed forms on asymmetric multicore processors 

      Alonso Jordá, Pedro; Catalán Pallarés, Sandra; Herrero Zaragoza, José Ramón; Quintana Ortí, Enrique Salvador; Rodríguez Sánchez, Rafael (2018-10)
      Article
      Open Access
      We investigate how to leverage the heterogeneous resources of an Asymmetric Multicore Processor (AMP) in order to deliver high performance in the reduction to condensed forms for the solution of dense eigenvalue and ...