Now showing items 1-12 of 12

    • An asymmetric distributed shared memory model for heterogeneous parallel systems 

      Gelado Fernandez, Isaac; Cabezas, Javier; Navarro, Nacho; Stone, John E.; Patel, Sanjay; Hwu, Wen-mei W. (2010)
      Conference report
      Open Access
      Heterogeneous computing combines general purpose CPUs with accelerators to efficiently execute both sequential control-intensive and data-parallel phases of applications. Existing programming models for heterogeneous ...
    • Comparison based sorting for systems with multiple GPUs 

      Tanasic, Ivan; Vilanova, Lluís; Jorda, Marc; Cabezas, Javier; Gelado Fernandez, Isaac; Navarro, Nacho; Hwu, Wen-mei W. (Association for Computing Machinery (ACM), 2013)
      Conference report
      Restricted access - publisher's policy
      As a basic building block of many applications, sorting algorithms that efficiently run on modern machines are key for the performance of these applications. With the recent shift to using GPUs for general purpose compuing, ...
    • CUsched: multiprogrammed workload scheduling on GPU architectures 

      Tanasic, Ivan; Gelado Fernandez, Isaac; Cabezas, Javier; Navarro, Nacho; Ramírez Bellido, Alejandro; Valero Cortés, Mateo (2013)
      External research report
      Open Access
      Graphic Processing Units (GPUs) are currently widely used in High Performance Computing (HPC) applications to speed-up the execution of massively-parallel codes. GPUs are well-suited for such HPC environments because ...
    • Efficient exception handling support for GPUs 

      Tanasic, Ivan; Gelado Fernandez, Isaac; Jorda, Marc; Ayguadé Parra, Eduard; Navarro, Nacho (Association for Computing Machinery (ACM), 2017)
      Conference report
      Restricted access - publisher's policy
      Operating systems have long relied on the exception handling mechanism to implement numerous virtual memory features and optimizations. However, today's GPUs have a limited support for exceptions, which prevents implementation ...
    • Enabling preemptive multiprogramming on GPUs 

      Tanasic, Ivan; Gelado Fernandez, Isaac; Cabezas, Javier; Ramírez Bellido, Alejandro; Navarro, Nacho; Valero Cortés, Mateo (Institute of Electrical and Electronics Engineers (IEEE), 2014)
      Conference report
      Open Access
      GPUs are being increasingly adopted as compute accelerators in many domains, spanning environments from mobile systems to cloud computing. These systems are usually running multiple applications, from one or several users. ...
    • Energy efficient HPC on embedded SoCs : optimization techniques for mali GPU 

      Grasso, Ivan; Radojkovic, Petar; Rajovic, Nikola; Gelado Fernandez, Isaac; Ramírez Bellido, Alejandro (Institute of Electrical and Electronics Engineers (IEEE), 2014)
      Conference report
      Restricted access - publisher's policy
      A lot of effort from academia and industry has been invested in exploring the suitability of low-power embedded technologies for HPC. Although state-of-the-art embedded systems-on-chip (SoCs) inherently contain GPUs that ...
    • Experiences with mobile processors for energy efficient HPC 

      Rajovic, Nikola; Rico Carro, Alejandro; Vipond, James; Gelado Fernandez, Isaac; Puzovic, Nikola; Ramírez Bellido, Alejandro (2013)
      Conference report
      Restricted access - publisher's policy
      The performance of High Performance Computing (HPC) systems is already limited by their power consumption. The majority of top HPC systems today are built from commodity server components that were designed for maximizing ...
    • High-performance reverse time migration on GPU 

      Cabezas, Javier; Ayala Polo, Mauricio; Gelado Fernandez, Isaac; Morancho Llena, Enrique; Navarro, Nacho; Cela Espín, José M. (2009-11)
      Conference report
      Open Access
      Partial Differential Equations (PDE) are the heart of most simulations in many scientific fields, from Fluid Mechanics to Astrophysics. One the most popular mathematical schemes to solve a PDE is Finite Difference (FD). In ...
    • On-Chip memories, the OS perspective 

      Villavieja Prados, Carlos; Gelado Fernandez, Isaac; Ramírez Bellido, Alejandro; Navarro, Nacho (2008-06-04)
      Conference report
      Open Access
      This paper is a work in progress study of the operating system services required to manage on-chip memories. We are evaluating different CMP on-chip memories configurations. Chip-MultiProcessors (CMP) architectures ...
    • Parallelizing general histogram application for CUDA architectures 

      Milic, Ugljesa; Gelado Fernandez, Isaac; Puzovic, Nikola; Ramírez Bellido, Alejandro; Tomasevic, Milo (IEEE Computational Intelligence Society, 2013)
      Conference report
      Restricted access - publisher's policy
      Histogramming is a tool commonly used in data analysis. Although its serial version is simple to implement, providing an efficient and scalable way to parallelize it can be challenging. This especially holds in case of ...
    • Predictive runtime code scheduling for heterogeneous architectures 

      Jimenez, Victor; Vilanova, Lluis; Gelado Fernandez, Isaac; Gil, Marisa; Fursin, Gregori; Navarro, Nacho (2009)
      Conference report
      Open Access
      Heterogeneous architectures are currently widespread. With the advent of easy-to-program general purpose GPUs, virtually every re- cent desktop computer is a heterogeneous system. Combining the CPU and the GPU brings ...
    • Supercomputing with commodity CPUs: are mobile SoCs ready for HPC? 

      Rajovic, Nikola; Carpenter, Paul; Gelado Fernandez, Isaac; Puzovic, Nikola; Ramírez Bellido, Alejandro; Valero Cortés, Mateo (Association for Computing Machinery (ACM), 2013)
      Conference report
      Restricted access - publisher's policy
      In the late 1990s, powerful economic forces led to the adoption of commodity desktop processors in high-performance computing. This transformation has been so effective that the June 2013 TOP500 list is still dominated by ...