Now showing items 1-20 of 100

    • A data-centric directive-based framework to accelerate out-of-core stencil computation on a GPU 

      Shen, Jingcheng; Ino, Fumihiko; Farrés Coma, Albert; Hanzich, Mauricio (Institute of Electronics, Information and Communication Engineers, 2020-12-01)
      Article
      Open Access
      Graphics processing units (GPUs) are highly efficient architectures for parallel stencil code; however, the small device (i.e., GPU) memory capacity (several tens of GBs) necessitates the use of out-of-core computation to ...
    • A low-power, high-performance speech recognition accelerator 

      Yazdani, Reza; Arnau Montañés, José María; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2019-12-01)
      Article
      Open Access
      Automatic Speech Recognition (ASR) is becoming increasingly ubiquitous, especially in the mobile segment. Fast and accurate ASR comes at high energy cost, not being affordable for the tiny power-budgeted mobile devices. ...
    • A methodology for selective protection of matrix multiplications: A diagnostic coverage and performance trade-off for CNNs executed on GPUs 

      Fernández Muñoz, Javier; Agirre Troncoso, Irune; Pérez Cerrolaza, Jon; Abella Ferrer, Jaume; Cazorla Almeida, Francisco Javier (Institute of Electrical and Electronics Engineers (IEEE), 2022)
      Conference report
      Open Access
      The ability of CNNs to efficiently and accurately perform complex functions, such as object detection, has fostered their adoption in safety-related autonomous systems. These algorithms require high computational performance ...
    • A Novel Set of Directives for Multi-device Programming with OpenMP 

      Torres, Raul; Ferrer, Roger; Teruel, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2022)
      Conference report
      Open Access
    • A software-only approach to enable diverse redundancy on Intel GPUs for safety-related kernels 

      Andriotis, Nikolaos; Serrano Cases, Alejandro; Alcaide Portet, Sergi; Abella Ferrer, Jaume; Cazorla Almeida, Francisco Javier; Peng, Yang; Baldovin, Andrea; Paulitsch, Michael; Tsymbal, Vladimir (Association for Computing Machinery (ACM), 2023)
      Conference report
      Open Access
      Autonomous Driving (AD) systems rely on object detection and tracking algorithms that require processing high volumes of data at high frequency. High-performance graphics processing units (GPUs) have been shown to provide ...
    • A symbolic emulator for shuffle synthesis on the NVIDIA PTX code 

      Matsumura, Kazuaki; García de Gonzalo, Simón; Peña Monferrer, Antonio José (Association for Computing Machinery (ACM), 2023)
      Conference report
      Open Access
      Various kinds of applications take advantage of GPUs through automation tools that attempt to automatically exploit the available performance of the GPU's parallel architecture. Directive-based programming models, such as ...
    • A unified memory approach to GPU acceleration on task based programming models 

      Rodriguez, Aimar; Beltran Querol, Vicenç (Barcelona Supercomputing Center, 2018-04-24)
      Conference report
      Open Access
    • Accelerating edit-distance sequence alignment on GPU using the wavefront algorithm 

      Aguado Puig, Quim; Marco-Sola, Santiago; Moure López, Juan Carlos; Castells Rufas, David; Álvarez Martí, Lluc; Espinosa Morales, Antonio; Moretó Planas, Miquel (Institute of Electrical and Electronics Engineers (IEEE), 2022-06-10)
      Article
      Open Access
      Sequence alignment remains a fundamental problem with practical applications ranging from pattern recognition to computational biology. Traditional algorithms based on dynamic programming are hard to parallelize, require ...
    • Accelerating K-mer Frequency Counting with GPU and Non-Volatile Memory 

      Cadenelli, Nicola; Polo Bardés, Jordà; Carrera, David (IEEE, 2018-02-15)
      Conference lecture
      Open Access
      The emergence of Next Generation Sequencing (NGS) platforms has increased the throughput of genomic sequencing and in turn the amount of data that needs to be processed, requiring highly efficient computation for its ...
    • Accelerating pairwise sequence alignment on GPUs using the Wavefront Algorithm 

      Aguado Puig, Quim (Universitat Politècnica de Catalunya, 2022-10-19)
      Master thesis
      Open Access
      Advances in genomics and sequencing technologies demand faster and more scalable analysis methods that can process longer sequences with higher accuracy. However, classical pairwise alignment methods, based on dynamic ...
    • Accelerating scientific applications on GPUs 

      Farré Gonzalez, Pau (Universitat Politècnica de Catalunya, 2016-07-04)
      Master thesis
      Open Access
      We have analyzed and accelerated two large scientific applications used at the Barcelona Supercomputer Center (BSC). With this, we want to show how two complex applications can be efficiently ported to GPUs. In addition, ...
    • Acceleration of synthetic aperture radar for on-board space systems 

      Solé i Bonet, Marc; Rodríguez Ferrández, Iván; Steenari, David; Kosmidis, Leonidas (Institute of Electrical and Electronics Engineers (IEEE), 2023)
      Conference report
      Open Access
      There is an increasing trend in modern space systems to move processing that until now was transmitted to ground for processing, on board the satellite. Synthetic Aperture Radar (SAR) is an example of such processing. ...
    • Achieving diverse redundancy for GPU Kernels 

      Alcaide Portet, Sergi; Kosmidis, Leonidas; Hernández Luz, Carles; Abella Ferrer, Jaume (Institute of Electrical and Electronics Engineers (IEEE), 2022-04)
      Article
      Open Access
      Autonomous driving requires high-performance computing devices including general-purpose CPUs as well as specific accelerators, with GPUs having a key role due to their flexibility. Safety-critical microcontrollers have ...
    • Advances in GPU architecture for deep learning and scientific computing 

      Parienté, Frédéric (Barcelona Supercomputing Center, 2016-09-10)
      Conference report
      Open Access
      The talk will cover the recent NVIDIA product announcements made at the GTC'16 conference, and how the Pascal GPU and NVLink interconnect technologies greatly improve multi-GPU performance and efficiency in deep learning ...
    • AMA: asynchronous management of accelerators for task-based programming models 

      Planas, Judit; Badia Sala, Rosa Maria; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Elsevier, 2015)
      Conference report
      Open Access
      Computational science has benefited in the last years from emerging accelerators that increase the performance of scientific simulations, but using these devices hinders the programming task. This paper presents AMA: a set ...
    • An extension of the StarSs programming model for platforms with multiple GPUs 

      Ayguadé Parra, Eduard; Badia Sala, Rosa Maria; Igual Peña, Francisco D.; Labarta Mancho, Jesús José; Mayo Gual, Rafael; Quintana Ortí, Enrique Salvador (Springer, 2009)
      Conference lecture
      Open Access
      While general-purpose homogeneous multi-core architectures are becoming ubiquitous, there are clear indications that, for a number of important applications, a better performance/power ratio can be attained using specialized ...
    • An on-board algorithm implementation on an embedded GPU: A space case study 

      Rodríguez Ferrandez, Iván; Kosmidis, Leonidas; Notebaert, Olivier; Cazorla Almeida, Francisco Javier; Steenari, David (Institute of Electrical and Electronics Engineers (IEEE), 2020)
      Conference report
      Open Access
      On-board processing requirements of future space missions are constantly increasing, calling for new hardware than the traditional ones used in space. Embedded GPUs are an attractive candidate offering both high performance ...
    • An open benchmark implementation for multi-CPU multi-GPU pedestrian detection in automotive systems 

      Trompouki, Matina M.; Kosmidis, Leonidas; Navarro, Nacho (IEEE, 2017-12-14)
      Conference lecture
      Open Access
      Modern and future automotive systems incorporate several Advanced Driving Assistance Systems (ADAS). Those systems require significant performance that cannot be provided with traditional automotive processors and programming ...
    • Assessing and improving the suitability of model-based design for GPU-accelerated railway control systems 

      Calderón Torres, Alejandro Josué; Kosmidis, Leonidas; Nicolás Ramírez, Carlos Fernando; Lasala, Javier de; Larrañaga, Ion (Springer Nature, 2021)
      Conference report
      Open Access
      Model-Based Design (MBD) is widely used for the design and simulation of electric traction control systems in the railway industry. Moreover, similar to other transportation industries, railway is moving towards the ...
    • Benchmarking CPUs and GPUs on embedded platforms for software receiver usage 

      Pany, T.; Dampf, J.; Bär, W.; Winkel, J.; Stöber, C.; Fürlinger, K.; Closas Gómez, Pau; García Molina, J. A. (2015)
      Conference report
      Open Access
      Smartphones containing multi-core central processing units (CPUs) and powerful many-core graphics processing units (GPUs) bring supercomputing technology into your pocket (or into our embedded devices). This can be exploited ...