Enviaments recents

  • Time-predictable task-to-thread mapping in multi-core processors 

    Samadi, Mohammad; Royuela Alcázar, Sara; Pinho, Luis Miguel; Carvalho, Tiago; Quiñones, Eduardo (Elsevier, 2024)
    Article
    Accés obert
    The performance of time-predictable systems can be improved in multi-core processors using parallel programming models (e.g., OpenMP). However, schedulability analysis of parallel applications is a big challenge due to ...
  • Boosting HPC data analysis performance with the ParSoDA-Py library 

    Belcastro, Loris; Giampà, Salvatore; Marozzo, Fabrizio; Talia, Domenico; Trunfio, Paolo; Badia Sala, Rosa Maria; Ejarque, Jorge; Mammadli, Nihad (Springer, 2024-02)
    Article
    Accés obert
    Developing and executing large-scale data analysis applications in parallel and distributed environments can be a complex and time-consuming task. Developers often find themselves diverted from their application logic to ...
  • Assessing Saiph, a task-based DSL for high-performance computational fluid dynamics 

    Macià Sorrosal, Sandra; Martínez Ferrer, Pedro José; Ayguadé Parra, Eduard; Beltran Querol, Vicenç (Elsevier, 2023-10)
    Article
    Accés restringit per política de l'editorial
    Scientific applications face the challenge of efficiently exploiting increasingly complex parallel and distributed systems. Developing hand-tuned codes is a time-consuming, tedious and hardly reusable task. Reaching high ...
  • Explaining the behaviour of reinforcement learning agents in a multi-agent cooperative environment using policy graphs 

    Domènech Vila, Marc; Gnatyshak, Dmitry; Tormos Llorente, Adrián; Giménez Ábalos, Víctor; Álvarez Napagao, Sergio (2024-01-31)
    Article
    Accés obert
    The adoption of algorithms based on Artificial Intelligence (AI) has been rapidly increasing during the last few years. However, some aspects of AI techniques are under heavy scrutiny. For instance, in many use cases, it ...
  • O(n) key–value sort with active compute memory 

    Esmaili Dokht, Pouya; Guiot Cusido, Miquel; Radojkovic, Petar; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Adlard, Jason; Amato, Paolo; Sforzin, Marco (Institute of Electrical and Electronics Engineers (IEEE), 2024-02-29)
    Article
    Accés obert
    We propose the Active Compute Memory (ACM), a near-memory-processing architecture capable of performing key–value sort directly in the DRAM. In the ACM architecture, sort is merely the writing of data into memory with one ...
  • Uncertainty Management in Dependable and Intelligent Embedded Software 

    Perez Cerrolaza, Jon; Cazorla Almeida, Francisco Javier; Abella Ferrer, Jaume (Institute of Electrical and Electronics Engineers (IEEE), 2023)
    Article
    Accés obert
    The development of dependable and intelligent embedded systems progresses. However, integrating complex software stacks, machine learning solutions, and high-performance computing devices amplifies the functional and ...
  • The MAMe dataset: On the relevance of high resolution and variable shape image properties 

    Parés Pont, Ferran; Arias Duart, Anna; García Gasulla, Dario; Campo Francés, Gema; Viladrich Iglesias, Nina; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Springer, 2022-08)
    Article
    Accés obert
    The mostcommon approach in image classification tasks is to resize all images in the dataset to a unique shape, while reducing their resolution to a size that makes experimentation at scale easier. This practice has benefits ...
  • Efficient data redistribution for malleable applications 

    Martínez Alvarez, Iker; Aliaga, José I; Castrillo, Maribel; Iserte, Sergio (Association for Computing Machinery (ACM), 2023)
    Comunicació de congrés
    Accés obert
    Process malleability can be defined as the ability of a distributed MPI parallel job to change the number of processes on–the–fly without stopping its execution, reallocating the compute resources originally assigned to ...
  • WFA-FPGA: An efficient accelerator of the wavefront algorithm for short and long read genomics alignment 

    Haghi, Abbas; Marco-Sola, Santiago; Álvarez Martí, Lluc; Diamantopoulos, Dionysios; Hagleitner, Christoph; Moretó Planas, Miquel (Elsevier, 2023-12)
    Article
    Accés restringit per política de l'editorial
    In the last years, advances in genome sequencing technologies have enabled the proliferation of genomic applications that guide personalized medicine. These applications have an enormous computational cost due to the large ...
  • Block size estimation for data partitioning in HPC applications using machine learning techniques 

    Cantini, Riccardo; Marozzo, Fabrizio; Orsino, Alessio; Talia, Domenico; Trunfio, Paolo; Badia Sala, Rosa Maria; Ejarque Artigas, Jorge; Vázquez-Novoa, Fernando (Springer Nature, 2024-01-16)
    Article
    Accés obert
    The extensive use of HPC infrastructures and frameworks for running data-intensive applications has led to a growing interest in data partitioning techniques and strategies. In fact, application performance can be heavily ...
  • Fine-grained adaptive parallelism for automotive systems through AMALTHEA and OpenMP 

    Munera Sánchez, Adrián; Royuela Alcázar, Sara; Pressler, Michael; Mackamul, Harald; Ziegenbein, Dirk; Quiñones Moreno, Eduardo (Elsevier, 2024-01)
    Article
    Accés restringit per política de l'editorial
    The software development complexity of automotive systems has significantly increased during the last decade due to the latest Advanced Driving Assistance System (ADAS) functionalities. To effectively address this complexity, ...
  • Taskgraph: a low contention OpenMP tasking framework 

    Yu, Chenle; Royuela Alcázar, Sara; Quiñones Moreno, Eduardo (2023-08)
    Article
    Accés obert
    OpenMP is the de-facto standard for shared memory systems in High-Performance Computing (HPC). It includes a tasking model that offers a high-level of abstraction to effectively exploit structured (loop-based) and highly ...

Mostra'n més