Browsing by Subject "Graphics processing units"
Now showing items 1-20 of 100
-
A data-centric directive-based framework to accelerate out-of-core stencil computation on a GPU
(Institute of Electronics, Information and Communication Engineers, 2020-12-01)
Article
Open AccessGraphics processing units (GPUs) are highly efficient architectures for parallel stencil code; however, the small device (i.e., GPU) memory capacity (several tens of GBs) necessitates the use of out-of-core computation to ... -
A low-power, high-performance speech recognition accelerator
(Institute of Electrical and Electronics Engineers (IEEE), 2019-12-01)
Article
Open AccessAutomatic Speech Recognition (ASR) is becoming increasingly ubiquitous, especially in the mobile segment. Fast and accurate ASR comes at high energy cost, not being affordable for the tiny power-budgeted mobile devices. ... -
A methodology for selective protection of matrix multiplications: A diagnostic coverage and performance trade-off for CNNs executed on GPUs
(Institute of Electrical and Electronics Engineers (IEEE), 2022)
Conference report
Open AccessThe ability of CNNs to efficiently and accurately perform complex functions, such as object detection, has fostered their adoption in safety-related autonomous systems. These algorithms require high computational performance ... -
A Novel Set of Directives for Multi-device Programming with OpenMP
(Institute of Electrical and Electronics Engineers (IEEE), 2022)
Conference report
Open Access -
A software-only approach to enable diverse redundancy on Intel GPUs for safety-related kernels
(Association for Computing Machinery (ACM), 2023)
Conference report
Open AccessAutonomous Driving (AD) systems rely on object detection and tracking algorithms that require processing high volumes of data at high frequency. High-performance graphics processing units (GPUs) have been shown to provide ... -
A symbolic emulator for shuffle synthesis on the NVIDIA PTX code
(Association for Computing Machinery (ACM), 2023)
Conference report
Open AccessVarious kinds of applications take advantage of GPUs through automation tools that attempt to automatically exploit the available performance of the GPU's parallel architecture. Directive-based programming models, such as ... -
A unified memory approach to GPU acceleration on task based programming models
(Barcelona Supercomputing Center, 2018-04-24)
Conference report
Open Access -
Accelerating edit-distance sequence alignment on GPU using the wavefront algorithm
(Institute of Electrical and Electronics Engineers (IEEE), 2022-06-10)
Article
Open AccessSequence alignment remains a fundamental problem with practical applications ranging from pattern recognition to computational biology. Traditional algorithms based on dynamic programming are hard to parallelize, require ... -
Accelerating K-mer Frequency Counting with GPU and Non-Volatile Memory
(IEEE, 2018-02-15)
Conference lecture
Open AccessThe emergence of Next Generation Sequencing (NGS) platforms has increased the throughput of genomic sequencing and in turn the amount of data that needs to be processed, requiring highly efficient computation for its ... -
Accelerating pairwise sequence alignment on GPUs using the Wavefront Algorithm
(Universitat Politècnica de Catalunya, 2022-10-19)
Master thesis
Open AccessAdvances in genomics and sequencing technologies demand faster and more scalable analysis methods that can process longer sequences with higher accuracy. However, classical pairwise alignment methods, based on dynamic ... -
Accelerating scientific applications on GPUs
(Universitat Politècnica de Catalunya, 2016-07-04)
Master thesis
Open AccessWe have analyzed and accelerated two large scientific applications used at the Barcelona Supercomputer Center (BSC). With this, we want to show how two complex applications can be efficiently ported to GPUs. In addition, ... -
Acceleration of synthetic aperture radar for on-board space systems
(Institute of Electrical and Electronics Engineers (IEEE), 2023)
Conference report
Open AccessThere is an increasing trend in modern space systems to move processing that until now was transmitted to ground for processing, on board the satellite. Synthetic Aperture Radar (SAR) is an example of such processing. ... -
Achieving diverse redundancy for GPU Kernels
(Institute of Electrical and Electronics Engineers (IEEE), 2022-04)
Article
Open AccessAutonomous driving requires high-performance computing devices including general-purpose CPUs as well as specific accelerators, with GPUs having a key role due to their flexibility. Safety-critical microcontrollers have ... -
Advances in GPU architecture for deep learning and scientific computing
(Barcelona Supercomputing Center, 2016-09-10)
Conference report
Open AccessThe talk will cover the recent NVIDIA product announcements made at the GTC'16 conference, and how the Pascal GPU and NVLink interconnect technologies greatly improve multi-GPU performance and efficiency in deep learning ... -
AMA: asynchronous management of accelerators for task-based programming models
(Elsevier, 2015)
Conference report
Open AccessComputational science has benefited in the last years from emerging accelerators that increase the performance of scientific simulations, but using these devices hinders the programming task. This paper presents AMA: a set ... -
An extension of the StarSs programming model for platforms with multiple GPUs
(Springer, 2009)
Conference lecture
Open AccessWhile general-purpose homogeneous multi-core architectures are becoming ubiquitous, there are clear indications that, for a number of important applications, a better performance/power ratio can be attained using specialized ... -
An on-board algorithm implementation on an embedded GPU: A space case study
(Institute of Electrical and Electronics Engineers (IEEE), 2020)
Conference report
Open AccessOn-board processing requirements of future space missions are constantly increasing, calling for new hardware than the traditional ones used in space. Embedded GPUs are an attractive candidate offering both high performance ... -
An open benchmark implementation for multi-CPU multi-GPU pedestrian detection in automotive systems
(IEEE, 2017-12-14)
Conference lecture
Open AccessModern and future automotive systems incorporate several Advanced Driving Assistance Systems (ADAS). Those systems require significant performance that cannot be provided with traditional automotive processors and programming ... -
Assessing and improving the suitability of model-based design for GPU-accelerated railway control systems
(Springer Nature, 2021)
Conference report
Open AccessModel-Based Design (MBD) is widely used for the design and simulation of electric traction control systems in the railway industry. Moreover, similar to other transportation industries, railway is moving towards the ... -
Benchmarking CPUs and GPUs on embedded platforms for software receiver usage
(2015)
Conference report
Open AccessSmartphones containing multi-core central processing units (CPUs) and powerful many-core graphics processing units (GPUs) bring supercomputing technology into your pocket (or into our embedded devices). This can be exploited ...