Articles de revista
Enviaments recents
-
Time-predictable task-to-thread mapping in multi-core processors
(Elsevier, 2024)
Article
Accés obertThe performance of time-predictable systems can be improved in multi-core processors using parallel programming models (e.g., OpenMP). However, schedulability analysis of parallel applications is a big challenge due to ... -
Boosting HPC data analysis performance with the ParSoDA-Py library
(Springer, 2024-02)
Article
Accés obertDeveloping and executing large-scale data analysis applications in parallel and distributed environments can be a complex and time-consuming task. Developers often find themselves diverted from their application logic to ... -
Assessing Saiph, a task-based DSL for high-performance computational fluid dynamics
(Elsevier, 2023-10)
Article
Accés restringit per política de l'editorialScientific applications face the challenge of efficiently exploiting increasingly complex parallel and distributed systems. Developing hand-tuned codes is a time-consuming, tedious and hardly reusable task. Reaching high ... -
Explaining the behaviour of reinforcement learning agents in a multi-agent cooperative environment using policy graphs
(2024-01-31)
Article
Accés obertThe adoption of algorithms based on Artificial Intelligence (AI) has been rapidly increasing during the last few years. However, some aspects of AI techniques are under heavy scrutiny. For instance, in many use cases, it ... -
O(n) key–value sort with active compute memory
(Institute of Electrical and Electronics Engineers (IEEE), 2024-02-29)
Article
Accés obertWe propose the Active Compute Memory (ACM), a near-memory-processing architecture capable of performing key–value sort directly in the DRAM. In the ACM architecture, sort is merely the writing of data into memory with one ... -
Uncertainty Management in Dependable and Intelligent Embedded Software
(Institute of Electrical and Electronics Engineers (IEEE), 2023)
Article
Accés obertThe development of dependable and intelligent embedded systems progresses. However, integrating complex software stacks, machine learning solutions, and high-performance computing devices amplifies the functional and ... -
The MAMe dataset: On the relevance of high resolution and variable shape image properties
(Springer, 2022-08)
Article
Accés obertThe mostcommon approach in image classification tasks is to resize all images in the dataset to a unique shape, while reducing their resolution to a size that makes experimentation at scale easier. This practice has benefits ... -
Efficient data redistribution for malleable applications
(Association for Computing Machinery (ACM), 2023)
Comunicació de congrés
Accés obertProcess malleability can be defined as the ability of a distributed MPI parallel job to change the number of processes on–the–fly without stopping its execution, reallocating the compute resources originally assigned to ... -
WFA-FPGA: An efficient accelerator of the wavefront algorithm for short and long read genomics alignment
(Elsevier, 2023-12)
Article
Accés restringit per política de l'editorialIn the last years, advances in genome sequencing technologies have enabled the proliferation of genomic applications that guide personalized medicine. These applications have an enormous computational cost due to the large ... -
Block size estimation for data partitioning in HPC applications using machine learning techniques
(Springer Nature, 2024-01-16)
Article
Accés obertThe extensive use of HPC infrastructures and frameworks for running data-intensive applications has led to a growing interest in data partitioning techniques and strategies. In fact, application performance can be heavily ... -
Fine-grained adaptive parallelism for automotive systems through AMALTHEA and OpenMP
(Elsevier, 2024-01)
Article
Accés restringit per política de l'editorialThe software development complexity of automotive systems has significantly increased during the last decade due to the latest Advanced Driving Assistance System (ADAS) functionalities. To effectively address this complexity, ... -
Taskgraph: a low contention OpenMP tasking framework
(2023-08)
Article
Accés obertOpenMP is the de-facto standard for shared memory systems in High-Performance Computing (HPC). It includes a tasking model that offers a high-level of abstraction to effectively exploit structured (loop-based) and highly ...