Now showing items 1-11 of 11

    • A square block format for symmetric band matrices 

      Gustavson, Fred G.; Herrero Zaragoza, José Ramón; Morancho Llena, Enrique (Springer, 2014)
      Conference report
      Open Access
      This contribution describes a Square Block, SB, format for storing a banded symmetric matrix. This is possible by rearranging “in place” LAPACK Band Layout to become a SB layout: store submatrices as a set of square blocks. ...
    • An automotive case study on the limits of approximation for object detection 

      Caro Roca, Martí; Tabani, Hamid; Abella Ferrer, Jaume; Moll Echeto, Francisco de Borja; Morancho Llena, Enrique; Canal Corretger, Ramon; Altet Sanahujes, Josep; Calomarde Palomino, Antonio; Cazorla Almeida, Francisco Javier; Rubio Romano, Antonio; Fontova Muste, Pau; Fornt Mas, Jordi (2023-05)
      Article
      Restricted access - publisher's policy
      The accuracy of camera-based object detection (CBOD) built upon deep learning is often evaluated against the real objects in frames only. However, such simplistic evaluation ignores the fact that many unimportant objects ...
    • Compute units in OpenMP: extensions for heterogeneous parallel programming 

      González Tallada, Marc; Morancho Llena, Enrique (John Wiley & sons, 2024-01-10)
      Article
      Open Access
      This article evaluates the current support for heterogeneous OpenMP 5.2 applications regarding the simultaneous activation of host and device computing units (e.g., CPUs, GPUs, or FPGAs). The article identifies limitations ...
    • Heterogeneous programming using OpenMP and CUDA/HIP for hybrid CPU-GPU scientific applications 

      González Tallada, Marc; Morancho Llena, Enrique (SAGE publishing, 2023-01-01)
      Article
      Open Access
      Hybrid computer systems combine compute units (CUs) of different nature like CPUs, GPUs and FPGAs. Simultaneously exploiting the computing power of these CUs requires a careful decomposition of the applications into balanced ...
    • High-performance reverse time migration on GPU 

      Cabezas, Javier; Ayala Polo, Mauricio; Gelado Fernandez, Isaac; Morancho Llena, Enrique; Navarro, Nacho; Cela Espín, José M. (2009-11)
      Conference report
      Open Access
      Partial Differential Equations (PDE) are the heart of most simulations in many scientific fields, from Fluid Mechanics to Astrophysics. One the most popular mathematical schemes to solve a PDE is Finite Difference (FD). In ...
    • Multi-GPU parallelization of the NAS multi-zone parallel benchmarks 

      González Tallada, Marc; Morancho Llena, Enrique (2021-01-01)
      Article
      Open Access
      GPU-based computing systems have become a widely accepted solution for the high-performance-computing (HPC) domain. GPUs have shown highly competitive performance-per-watt ratios and can exploit an astonishing level of ...
    • Multi-GPU systems and Unified Virtual Memory for scientific applications: The case of the NAS multi-zone parallel benchmarks 

      González Tallada, Marc; Morancho Llena, Enrique (Elsevier, 2021-12)
      Article
      Open Access
      GPU-based computing systems have become a widely accepted solution for the high-performance-computing (HPC) domain. GPUs have shown highly competitive performance-per-watt ratios and can exploit an astonishing level of ...
    • On reducing misspeculations on a pipelined scheduler 

      Gran Tejero, Ruben; Morancho Llena, Enrique; Olivé Durán, Ángel; Llaberia Griñó, José M. (2009)
      Conference report
      Open Access
      Pipelining the scheduling logic, which exposes and exploits the instruction level parallelism, degrades processor performance. In a 4-issue processor, our evaluations show that pipelining the scheduling logic over two ...
    • Solving 'Still life' with soft constraints and bucket elimination 

      Morancho Llena, Enrique; Larrosa Bondia, Francisco Javier (2003-04)
      Research report
      Open Access
      In this paper we study the aplicability of bucket elimination (BE) to the problem of finding stilllife patterns. Very recently, it has been tackled using integer programming and constraint programming, both of them ...
    • Two examples of approximate arithmetic to reduce hardware complexity and power consumption 

      Fornt Mas, Jordi; Jin, Leixin; Etxezarreta, Imanol; Fontova, Pau; Altet Sanahujes, Josep; Calomarde Palomino, Antonio; Morancho Llena, Enrique; Moll Echeto, Francisco de Borja; Rubio Sola, Jose Antonio (Institute of Electrical and Electronics Engineers (IEEE), 2022)
      Conference lecture
      Open Access
      As the end of Moore's Law approaches, electronic system designers must find ways to keep up with the ever increasing computational demands of the modern era. Some computationally intensive applications, such as multimedia ...
    • UNIX : crides al sistema i comandes : pràctiques amb EINAM 

      Morancho Llena, Enrique (Edicions UPC, 2006)
      Book
      Restricted access to UB, UAB, UPC, UPF, UdG, UdL, URV, UOC, BC, UVic, UJI, URL, UIC users
      Aquest llibre pretén facilitar que els usuaris d’Einam (o de qualsevol altra distribució Linux o versió d’UNIX) sàpiguen interaccionar a baix nivell amb el sistema operatiu GNU/Linux. Concretament, s’explicarà el nivell ...