Recent Submissions

  • Enhancing iteration performance on distributed task-based workflows 

    Barceló Cuerda, Alex; Queralt Calafat, Anna; Cortés, Toni (Elsevier, 2023-12)
    Article
    Restricted access - publisher's policy
    Task-based programming models have proven to be a robust and versatile way to approach development of applications for distributed environments. They provide natural programming patterns with high performance. However, ...
  • Space Shuttle: A test vehicle for the reliability of the SkyWater 130nm PDK for future space processors 

    Rodríguez Ferrández, Iván; Kosmidis, Leonidas; Tali, Maris; Steenari, David (Institute of Electrical and Electronics Engineers (IEEE), 2023)
    Conference lecture
    Open Access
    Recently the ASIC industry experiences a massive change with more and more small and medium businesses entering the custom ASIC development. This trend is fueled by the recent open hardware movement and relevant government ...
  • Sign language translation from instructional videos 

    Tarrés Benet, Laia; Gallego Olsina, Gerard Ion; Cardoso Duarte, Amanda; Torres Viñals, Jordi; Giró Nieto, Xavier (Computer Vision Foundation, 2023)
    Conference report
    Open Access
    The advances in automatic sign language translation (SLT) to spoken languages have been mostly benchmarked with datasets of limited size and restricted domains. Our work advances the state of the art by providing the first ...
  • Towards spatial multiplexing in wireless networks within computing packages 

    Rodríguez Galán, Fátima Yolanda; de Santana, Elana Pereira; Bolívar, Peter Haring; Abadal Cavallé, Sergi; Alarcón Cot, Eduardo José (Association for Computing Machinery (ACM), 2022)
    Conference lecture
    Open Access
    Wireless Networks-on-Chip (WNoCs) are regarded as a disruptive alternative to conventional interconnection networks at the chip scale, yet limited by the relatively low aggregate bandwidth of such wireless networks. Hence, ...
  • An energy-efficient GeMM-based convolution accelerator with on-the-fly im2col 

    Fornt Mas, Jordi; Fontova Muste, Pau; Caro Roca, Martí; Abella Ferrer, Jaume; Moll Echeto, Francisco de Borja; Altet Sanahujes, Josep; Studer, Christoph (2023-06-27)
    Article
    Open Access
    Systolic array architectures have recently emerged as successful accelerators for deep convolutional neural network (CNN) inference. Such architectures can be used to efficiently execute general matrix–matrix multiplications ...
  • Advanced synchronization techniques for task-based runtime systems 

    Álvarez Robert, David; Sala Penadés, Kevin; Maroñas Bravo, Marcos; Roca Nonell, Aleix; Beltran Querol, Vicenç (Association for Computing Machinery (ACM), 2021)
    Conference report
    Open Access
    Task-based programming models like OmpSs-2 and OpenMP provide a flexible data-flow execution model to exploit dynamic, irregular and nested parallelism. Providing an efficient implementation that scales well with small ...
  • A software-only approach to enable diverse redundancy on Intel GPUs for safety-related kernels 

    Andriotis, Nikolaos; Serrano Cases, Alejandro; Alcaide Portet, Sergi; Abella Ferrer, Jaume; Cazorla Almeida, Francisco Javier; Peng, Yang; Baldovin, Andrea; Paulitsch, Michael; Tsymbal, Vladimir (Association for Computing Machinery (ACM), 2023)
    Conference report
    Open Access
    Autonomous Driving (AD) systems rely on object detection and tracking algorithms that require processing high volumes of data at high frequency. High-performance graphics processing units (GPUs) have been shown to provide ...
  • DynAMO: Improving parallelism through dynamic placement of atomic memory operations 

    Soria Pardos, Víctor; Armejach Sanosa, Adrià; Mück, Tiago; Suárez Gracía, Dario; Joao, Jose A.; Rico, Alejandro; Moreto Planas, Miquel (Association for Computing Machinery (ACM), 2023)
    Conference report
    Open Access
    With increasing core counts in modern multi-core designs, the overhead of synchronization jeopardizes the scalability and efficiency of parallel applications. To mitigate these overheads, modern cache-coherent protocols ...
  • Tackling low-resourced sign language translation: UPC at WMT-SLT 22 

    Tarrés Benet, Laia; Gallego Olsina, Gerard Ion; Giró Nieto, Xavier; Torres Viñals, Jordi (Association for Computational Linguistics, 2022)
    Conference lecture
    Open Access
    This paper describes the system developed at the Universitat Politècnica de Catalunya for the Workshop on Machine Translation 2022 Sign Language Translation Task, in particular, for the sign-to-text direction. We use a ...
  • Lightweight register file caching in collector units for GPUs 

    Abaie Shoushtary, Mojtaba; Arnau Montañés, José María; Tubella Murgadas, Jordi; González Colás, Antonio María (Association for Computing Machinery (ACM), 2023)
    Conference report
    Open Access
    Modern GPUs benefit from a sizable Register File (RF) to provide fine-grained thread switching. As the RF is huge and accessed frequently, it consumes a considerable share of the dynamic energy of the GPU. Designing a ...
  • ProGNNosis: A data-driven model to predict GNN computation time using graph metrics 

    Wassington, Axel; Abadal Cavallé, Sergi (2022)
    Conference report
    Open Access
    Graph Neural Networks (GNN) show great promise in problems dealing with graph-structured data. One of the unique points of GNNs is their flexibility to adapt to multiple problems, which not only leads to wide applicability, ...
  • Fast behavioural RTL simulation of 10B transistor SoC designs with Metro-Mpi 

    López Paradís, Guillem; Li, Brian; Armejach Sanosa, Adrià; Wallentowitz, Stefan; Moreto Planas, Miquel; Balkind, Jonathan (Institute of Electrical and Electronics Engineers (IEEE), 2023)
    Conference report
    Open Access
    Chips with tens of billions of transistors have become today's norm. These designs are straining our electronic design automation tools throughout the design process, requiring ever more computational resources. In many ...

View more