Doctorat en Arquitectura de Computadors
Collections in this community
-
Articles de revista [152]
Recent Submissions
-
Enhancing iteration performance on distributed task-based workflows
(Elsevier, 2023-12)
Article
Restricted access - publisher's policyTask-based programming models have proven to be a robust and versatile way to approach development of applications for distributed environments. They provide natural programming patterns with high performance. However, ... -
Space Shuttle: A test vehicle for the reliability of the SkyWater 130nm PDK for future space processors
(Institute of Electrical and Electronics Engineers (IEEE), 2023)
Conference lecture
Open AccessRecently the ASIC industry experiences a massive change with more and more small and medium businesses entering the custom ASIC development. This trend is fueled by the recent open hardware movement and relevant government ... -
Sign language translation from instructional videos
(Computer Vision Foundation, 2023)
Conference report
Open AccessThe advances in automatic sign language translation (SLT) to spoken languages have been mostly benchmarked with datasets of limited size and restricted domains. Our work advances the state of the art by providing the first ... -
Towards spatial multiplexing in wireless networks within computing packages
(Association for Computing Machinery (ACM), 2022)
Conference lecture
Open AccessWireless Networks-on-Chip (WNoCs) are regarded as a disruptive alternative to conventional interconnection networks at the chip scale, yet limited by the relatively low aggregate bandwidth of such wireless networks. Hence, ... -
An energy-efficient GeMM-based convolution accelerator with on-the-fly im2col
(2023-06-27)
Article
Open AccessSystolic array architectures have recently emerged as successful accelerators for deep convolutional neural network (CNN) inference. Such architectures can be used to efficiently execute general matrix–matrix multiplications ... -
Advanced synchronization techniques for task-based runtime systems
(Association for Computing Machinery (ACM), 2021)
Conference report
Open AccessTask-based programming models like OmpSs-2 and OpenMP provide a flexible data-flow execution model to exploit dynamic, irregular and nested parallelism. Providing an efficient implementation that scales well with small ... -
A software-only approach to enable diverse redundancy on Intel GPUs for safety-related kernels
(Association for Computing Machinery (ACM), 2023)
Conference report
Open AccessAutonomous Driving (AD) systems rely on object detection and tracking algorithms that require processing high volumes of data at high frequency. High-performance graphics processing units (GPUs) have been shown to provide ... -
DynAMO: Improving parallelism through dynamic placement of atomic memory operations
(Association for Computing Machinery (ACM), 2023)
Conference report
Open AccessWith increasing core counts in modern multi-core designs, the overhead of synchronization jeopardizes the scalability and efficiency of parallel applications. To mitigate these overheads, modern cache-coherent protocols ... -
Tackling low-resourced sign language translation: UPC at WMT-SLT 22
(Association for Computational Linguistics, 2022)
Conference lecture
Open AccessThis paper describes the system developed at the Universitat Politècnica de Catalunya for the Workshop on Machine Translation 2022 Sign Language Translation Task, in particular, for the sign-to-text direction. We use a ... -
Lightweight register file caching in collector units for GPUs
(Association for Computing Machinery (ACM), 2023)
Conference report
Open AccessModern GPUs benefit from a sizable Register File (RF) to provide fine-grained thread switching. As the RF is huge and accessed frequently, it consumes a considerable share of the dynamic energy of the GPU. Designing a ... -
ProGNNosis: A data-driven model to predict GNN computation time using graph metrics
(2022)
Conference report
Open AccessGraph Neural Networks (GNN) show great promise in problems dealing with graph-structured data. One of the unique points of GNNs is their flexibility to adapt to multiple problems, which not only leads to wide applicability, ... -
Fast behavioural RTL simulation of 10B transistor SoC designs with Metro-Mpi
(Institute of Electrical and Electronics Engineers (IEEE), 2023)
Conference report
Open AccessChips with tens of billions of transistors have become today's norm. These designs are straining our electronic design automation tools throughout the design process, requiring ever more computational resources. In many ...