Now showing items 1-10 of 10

    • cuHinesBatch: solving multiple hines systems on GPUs Human Brain Project 

      Valero-Lara, Pedro; Martinez-Perez, Ivan; Peña, Antonio J.; Martorell Bofill, Xavier; Sirvent, Raul; Labarta Mancho, Jesús José (Elsevier, 2017)
      Article
      Open Access
      The simulation of the behavior of the Human Brain is one of the most important challenges today in computing. The main problem consists of finding efficient ways to manipulate and compute the huge volume of data that this ...
    • MPI+OpenMP tasking scalability for multi-morphology simulations of the human brain 

      Valero-Lara, Pedro; Sirvent, Raül; Peña, Antonio J.; Labarta Mancho, Jesús José (2019-05)
      Article
      Open Access
      The simulation of the behavior of the human brain is one of the most ambitious challenges today with a non-end of important applications. We can find many different initiatives in the USA, Europe and Japan which attempt ...
    • MPI+OpenMP tasking scalability for the simulation of the human brain 

      Valero-Lara, Pedro; Sirvent, Raul; Pena, A. J.; Martorell Bofill, Xavier; Labarta Mancho, Jesús José (Association for Computing Machinery (ACM), 2018)
      Conference report
      Open Access
      The simulation of the behavior of the Human Brain is one of the most ambitious challenges today with a non-end of important applications. We can find many different initiatives in the USA, Europe and Japan which attempt ...
    • Performance Evaluation of cuDNN Convolution Algorithms on NVIDIA Volta GPUs 

      Jordà, Marc; Valero-Lara, Pedro; Peña Monferrer, Antonio José (IEEE, 2019-05-24)
      Article
      Open Access
      Convolutional neural networks (CNNs) have recently attracted considerable attention due to their outstanding accuracy in applications, such as image recognition and natural language processing. While one advantage of the ...
    • Reducing memory requirements for large size LBM simulations on GPUs 

      Valero-Lara, Pedro (Wiley, 2017-12)
      Article
      Open Access
      The scientific community in its never-ending road of larger and more efficient computational resources is in need of more efficient implementations that can adapt efficiently on the current parallel platforms. Graphics ...
    • Simulating the behavior of the human brain on GPUS 

      Valero-Lara, Pedro; Martinez-Perez, Ivan; Sirvent, Raul; Peña, Antonio J.; Martorell Bofill, Xavier; Labarta Mancho, Jesús José (2018-01-01)
      Article
      Open Access
      The simulation of the behavior of the Human Brain is one of the most important challenges in computing today. The main problem consists of finding efficient ways to manipulate and compute the huge volume of data that this ...
    • Tasking in accelerators: performance evaluation 

      Toledo, Leonel; Peña, Antonio J.; Catalán, Sandra; Valero-Lara, Pedro (Institute of Electrical and Electronics Engineers (IEEE), 2019)
      Conference report
      Open Access
      In this work, we analyze the implications and results of implementing dynamic parallelism, concurrent kernels and CUDA Graphs to solve task-oriented problems. As a benchmark we propose three different methods for solving ...
    • The Design and Performance of Batched BLAS on Modern High-Performance Computing Systems 

      Dongarra, Jack; Hammarling, Sven; Higham, Nicholas J.; Relton, Samuel D.; Valero-Lara, Pedro; Zounon, Mawussi (Elsevier, 2017)
      Article
      Open Access
      A current trend in high-performance computing is to decompose a large linear algebra problem into batches containing thousands of smaller problems, that can be solved independently, before collating the results. To standardize ...
    • Towards HPC-Embedded. Case Study: Kalray and Message-Passing on NoC 

      Valero-Lara, Pedro; Krishnasamy, Ezhilmathi; Jansson, Johan (Universitatea de Vest, 2017-06)
      Article
      Open Access
      Today one of the most important challenges in HPC is the development of computers with a low power consumption. In this context, recently, new embedded many-core systems have emerged. One of them is Kalray. Unlike other ...
    • Variable batched DGEMM 

      Valero-Lara, Pedro; Martinez-Perez, Ivan; Mateo, Sergio; Sirvent Pardell, Raül; Beltran Querol, Vicenç; Martorell Bofill, Xavier; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2018)
      Conference report
      Open Access
      Many scientific applications are in need to solve a high number of small-size independent problems. These individual problems do not provide enough parallelism and then, these must be computed as a batch. Today, vendors ...