Ara es mostren els items 1-20 de 35

    • A library implementation of the nano-threads programming model 

      Martorell Bofill, Xavier; Labarta Mancho, Jesús José; Navarro, Nacho; Ayguadé Parra, Eduard (Springer, 1996)
      Text en actes de congrés
      Accés obert
      In this paper we describe the design and implementation of a user-level thread package based on the nano-threads programming model, whose goal is to efficiently manage the application parallelism at user-level. Nano-thread ...
    • Adaptive runtime-assisted block prefetching on chip-multiprocessors 

      García Flores, Víctor; Rico Carro, Alejandro; Villavieja Prados, Carlos; Carpenter, Paul Matthew; Navarro, Nacho; Ramirez, Alex (2016-04-29)
      Article
      Accés obert
      Memory stalls are a significant source of performance degradation in modern processors. Data prefetching is a widely adopted and well studied technique used to alleviate this problem. Prefetching can be performed by the ...
    • Adjust of solar cell model parameters based in MATLAB 

      Silvestre Bergés, Santiago; Guasch Murillo, Daniel; Ortega Villasclaras, Pablo Rafael; Navarro, Nacho (M. C. Acero, M. Lozano - CNM-CSIC, 2003)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      In this work, a model parameter extractor of solar cells using Matlab is shown. The software, developed in Matlab environment, has been successfully applied to study internal characteristics of photovoltaic devices [1-2], ...
    • An asymmetric distributed shared memory model for heterogeneous parallel systems 

      Gelado Fernandez, Isaac; Cabezas, Javier; Navarro, Nacho; Stone, John E.; Patel, Sanjay; Hwu, Wen-mei W. (2010)
      Text en actes de congrés
      Accés obert
      Heterogeneous computing combines general purpose CPUs with accelerators to efficiently execute both sequential control-intensive and data-parallel phases of applications. Existing programming models for heterogeneous ...
    • An in-operation planning tool architecture for flexgrid network re-optimization 

      Gifre Renom, Lluís; Castro Casales, Alberto; Ruiz Ramírez, Marc; Navarro, Nacho; Velasco Esteban, Luis Domingo (Institute of Electrical and Electronics Engineers (IEEE), 2014)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      In-operation network planning consists in re-optimizing a network, currently being operated to transport traffic, either minimizing resource utilization or maximizing the transported traffic. In the context of flexgrid ...
    • An open benchmark implementation for multi-CPU multi-GPU pedestrian detection in automotive systems 

      Trompouki, Matina M.; Kosmidis, Leonidas; Navarro, Nacho (IEEE, 2017-12-14)
      Comunicació de congrés
      Accés obert
      Modern and future automotive systems incorporate several Advanced Driving Assistance Systems (ADAS). Those systems require significant performance that cannot be provided with traditional automotive processors and programming ...
    • Analyzing performance improvements and energy savings in Infiniband architecture using network compression 

      Dickov, Branimir; Pericas, Miquel; Carpenter, Paul Matthew; Navarro, Nacho; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2014)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      One of the greatest challenges in HPC is total system power and energy consumption. Whereas HPC interconnects have traditionally been designed with a focus on bandwidth and latency, there is an increasing interest in ...
    • Applying interposition techniques for performance analysis of OPENMP parallel applications 

      González Tallada, Marc; Serra, Albert; Martorell Bofill, Xavier; Oliver Segura, José; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Navarro, Nacho (Institute of Electrical and Electronics Engineers (IEEE), 2000)
      Text en actes de congrés
      Accés obert
      Tuning parallel applications requires the use of effective tools for detecting performance bottlenecks. Along a parallel program execution, many individual situations of performance degradation may arise. We believe that ...
    • Architecture of a specialized back-end high performance computing-based PCE for flexgrid networks 

      Gifre Renom, Lluís; Velasco Esteban, Luis Domingo; Navarro, Nacho (Institute of Electrical and Electronics Engineers (IEEE), 2013)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      The requirement of executing network re-optimization operations to efficiently manage and deploy new generation flexgrid-based optical networks has brought to light the need of some specialized PCEs capable of performing ...
    • CODOMs: Protecting software with code-centric memory domains 

      Vilanova, Lluís; Ben-Yehuda, Muli; Navarro, Nacho; Etsion, Yoav; Valero Cortés, Mateo (Institute of Electrical and Electronics Engineers (IEEE), 2014)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Today's complex software systems are neither secure nor reliable. The rudimentary software protection primitives provided by current hardware forces systems to run many distrusting software components (e.g., procedures, ...
    • Coherence protocol for transparent management of scratchpad memories in shared memory manycore architectures 

      Álvarez Martí, Lluc; Vilanova, Lluís; Moretó Planas, Miquel; Casas, Marc; González Tallada, Marc; Martorell Bofill, Xavier; Navarro, Nacho; Ayguadé Parra, Eduard; Valero Cortés, Mateo (Association for Computing Machinery (ACM), 2015)
      Text en actes de congrés
      Accés obert
      The increasing number of cores in manycore architectures causes important power and scalability problems in the memory subsystem. One solution is to introduce scratchpad memories alongside the cache hierarchy, forming a ...
    • Comparison based sorting for systems with multiple GPUs 

      Tanasic, Ivan; Vilanova, Lluís; Jorda, Marc; Cabezas, Javier; Gelado Fernandez, Isaac; Navarro, Nacho; Hwu, Wen-mei W. (Association for Computing Machinery (ACM), 2013)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      As a basic building block of many applications, sorting algorithms that efficiently run on modern machines are key for the performance of these applications. With the recent shift to using GPUs for general purpose compuing, ...
    • CUsched: multiprogrammed workload scheduling on GPU architectures 

      Tanasic, Ivan; Gelado Fernandez, Isaac; Cabezas, Javier; Navarro, Nacho; Ramírez Bellido, Alejandro; Valero Cortés, Mateo (2013)
      Report de recerca
      Accés obert
      Graphic Processing Units (GPUs) are currently widely used in High Performance Computing (HPC) applications to speed-up the execution of massively-parallel codes. GPUs are well-suited for such HPC environments because ...
    • Data stream classification using random feature functions and novel method combinations 

      Marrón Vida, Diego; Read, Jesse; Bifet Figuerol, Albert Carles; Navarro, Nacho (2017-05-01)
      Article
      Accés obert
      Big Data streams are being generated in a faster, bigger, and more commonplace. In this scenario, Hoeffding Trees are an established method for classification. Several extensions exist, including high performing ensemble ...
    • Design space explorations for streaming accelerators using streaming architectural simulator 

      Shafiq, Muhammad; Pericas, Miquel; Navarro, Nacho; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2013)
      Text en actes de congrés
      Accés obert
      In the recent years streaming accelerators like GPUs have been pop-up as an effective step towards parallel computing. The wish-list for these devices span from having a support for thousands of small cores to a nature ...
    • Direct Inter-Process Communication (dIPC): Repurposing the CODOMs architecture to accelerate IPC 

      Vilanova, Lluis; Jordà Peroliu, Marc; Navarro, Nacho; Etsion, Yoav; Valero Cortés, Mateo (Association for Computing Machinery (ACM), 2017)
      Text en actes de congrés
      Accés obert
      In current architectures, page tables are the fundamental mechanism that allows contemporary OSs to isolate user processes, binding each thread to a specific page table. A thread cannot therefore directly call another ...
    • Efficient exception handling support for GPUs 

      Tanasic, Ivan; Gelado Fernandez, Isaac; Jorda, Marc; Ayguadé Parra, Eduard; Navarro, Nacho (Association for Computing Machinery (ACM), 2017)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Operating systems have long relied on the exception handling mechanism to implement numerous virtual memory features and optimizations. However, today's GPUs have a limited support for exceptions, which prevents implementation ...
    • Enabling preemptive multiprogramming on GPUs 

      Tanasic, Ivan; Gelado Fernandez, Isaac; Cabezas, Javier; Ramírez Bellido, Alejandro; Navarro, Nacho; Valero Cortés, Mateo (Institute of Electrical and Electronics Engineers (IEEE), 2014)
      Text en actes de congrés
      Accés obert
      GPUs are being increasingly adopted as compute accelerators in many domains, spanning environments from mobile systems to cloud computing. These systems are usually running multiple applications, from one or several users. ...
    • Experimental assessment of a high performance back-end PCE for Flexgrid optical network re-optimization 

      Gifre Renom, Lluís; Velasco Esteban, Luis Domingo; Navarro, Nacho; Junyent Giralt, Gabriel (The Optical Society (OSA), 2014)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      A specialized high performance Graphics Processing Unit (GPU)-based back-end Path Computation Element (PCE) to compute re-optimization in Flexgrid networks is presented. Experimental results show 6x speedups compared to ...
    • Hardware-software coherence protocol for the coexistence of caches and local memories 

      Álvarez Martí, Lluc; Vilanova, Lluís; González Tallada, Marc; Martorell Bofill, Xavier; Navarro, Nacho; Ayguadé Parra, Eduard (2015-01-01)
      Article
      Accés obert
      Cache coherence protocols limit the scalability of multicore and manycore architectures and are responsible for an important amount of the power consumed in the chip. A good way to alleviate these problems is to introduce ...