Browsing by Author "Valero-Lara, Pedro"
Now showing items 1-10 of 10
-
cuHinesBatch: solving multiple hines systems on GPUs Human Brain Project
Valero-Lara, Pedro; Martinez-Perez, Ivan; Peña, Antonio J.; Martorell Bofill, Xavier; Sirvent, Raul; Labarta Mancho, Jesús José (Elsevier, 2017)
Article
Open AccessThe simulation of the behavior of the Human Brain is one of the most important challenges today in computing. The main problem consists of finding efficient ways to manipulate and compute the huge volume of data that this ... -
MPI+OpenMP tasking scalability for multi-morphology simulations of the human brain
Valero-Lara, Pedro; Sirvent, Raül; Peña, Antonio J.; Labarta Mancho, Jesús José (2019-05)
Article
Open AccessThe simulation of the behavior of the human brain is one of the most ambitious challenges today with a non-end of important applications. We can find many different initiatives in the USA, Europe and Japan which attempt ... -
MPI+OpenMP tasking scalability for the simulation of the human brain
Valero-Lara, Pedro; Sirvent, Raul; Pena, A. J.; Martorell Bofill, Xavier; Labarta Mancho, Jesús José (Association for Computing Machinery (ACM), 2018)
Conference report
Open AccessThe simulation of the behavior of the Human Brain is one of the most ambitious challenges today with a non-end of important applications. We can find many different initiatives in the USA, Europe and Japan which attempt ... -
Performance Evaluation of cuDNN Convolution Algorithms on NVIDIA Volta GPUs
Jordà, Marc; Valero-Lara, Pedro; Peña Monferrer, Antonio José (IEEE, 2019-05-24)
Article
Open AccessConvolutional neural networks (CNNs) have recently attracted considerable attention due to their outstanding accuracy in applications, such as image recognition and natural language processing. While one advantage of the ... -
Reducing memory requirements for large size LBM simulations on GPUs
Valero-Lara, Pedro (Wiley, 2017-12)
Article
Open AccessThe scientific community in its never-ending road of larger and more efficient computational resources is in need of more efficient implementations that can adapt efficiently on the current parallel platforms. Graphics ... -
Simulating the behavior of the human brain on GPUS
Valero-Lara, Pedro; Martinez-Perez, Ivan; Sirvent, Raul; Peña, Antonio J.; Martorell Bofill, Xavier; Labarta Mancho, Jesús José (2018-01-01)
Article
Open AccessThe simulation of the behavior of the Human Brain is one of the most important challenges in computing today. The main problem consists of finding efficient ways to manipulate and compute the huge volume of data that this ... -
Tasking in accelerators: performance evaluation
Toledo, Leonel; Peña, Antonio J.; Catalán, Sandra; Valero-Lara, Pedro (Institute of Electrical and Electronics Engineers (IEEE), 2019)
Conference report
Open AccessIn this work, we analyze the implications and results of implementing dynamic parallelism, concurrent kernels and CUDA Graphs to solve task-oriented problems. As a benchmark we propose three different methods for solving ... -
The Design and Performance of Batched BLAS on Modern High-Performance Computing Systems
Dongarra, Jack; Hammarling, Sven; Higham, Nicholas J.; Relton, Samuel D.; Valero-Lara, Pedro; Zounon, Mawussi (Elsevier, 2017)
Article
Open AccessA current trend in high-performance computing is to decompose a large linear algebra problem into batches containing thousands of smaller problems, that can be solved independently, before collating the results. To standardize ... -
Towards HPC-Embedded. Case Study: Kalray and Message-Passing on NoC
Valero-Lara, Pedro; Krishnasamy, Ezhilmathi; Jansson, Johan (Universitatea de Vest, 2017-06)
Article
Open AccessToday one of the most important challenges in HPC is the development of computers with a low power consumption. In this context, recently, new embedded many-core systems have emerged. One of them is Kalray. Unlike other ... -
Variable batched DGEMM
Valero-Lara, Pedro; Martinez-Perez, Ivan; Mateo, Sergio; Sirvent Pardell, Raül; Beltran Querol, Vicenç; Martorell Bofill, Xavier; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2018)
Conference report
Open AccessMany scientific applications are in need to solve a high number of small-size independent problems. These individual problems do not provide enough parallelism and then, these must be computed as a batch. Today, vendors ...