Now showing items 1-9 of 9

  • cuHinesBatch: solving multiple hines systems on GPUs Human Brain Project 

    Valero-Lara, Pedro; Martinez-Perez, Ivan; Peña, Antonio J.; Martorell Bofill, Xavier; Sirvent, Raul; Labarta Mancho, Jesús José (Elsevier, 2017)
    Article
    Open Access
    The simulation of the behavior of the Human Brain is one of the most important challenges today in computing. The main problem consists of finding efficient ways to manipulate and compute the huge volume of data that this ...
  • MPI+OpenMP tasking scalability for multi-morphology simulations of the human brain 

    Valero-Lara, Pedro; Sirvent, Raül; Peña, Antonio J.; Labarta Mancho, Jesús José (2019-05)
    Article
    Restricted access - publisher's policy
    The simulation of the behavior of the human brain is one of the most ambitious challenges today with a non-end of important applications. We can find many different initiatives in the USA, Europe and Japan which attempt ...
  • MPI+OpenMP tasking scalability for the simulation of the human brain 

    Valero-Lara, Pedro; Sirvent, Raul; Pena, A. J.; Martorell Bofill, Xavier; Labarta Mancho, Jesús José (Association for Computing Machinery (ACM), 2018)
    Conference report
    Open Access
    The simulation of the behavior of the Human Brain is one of the most ambitious challenges today with a non-end of important applications. We can find many different initiatives in the USA, Europe and Japan which attempt ...
  • Performance Evaluation of cuDNN Convolution Algorithms on NVIDIA Volta GPUs 

    Jordà, Marc; Valero-Lara, Pedro; Peña, Antonio J. (IEEE, 2019-05-24)
    Article
    Open Access
    Convolutional neural networks (CNNs) have recently attracted considerable attention due to their outstanding accuracy in applications, such as image recognition and natural language processing. While one advantage of the ...
  • Reducing memory requirements for large size LBM simulations on GPUs 

    Valero-Lara, Pedro (Wiley, 2017-12)
    Article
    Open Access
    The scientific community in its never-ending road of larger and more efficient computational resources is in need of more efficient implementations that can adapt efficiently on the current parallel platforms. Graphics ...
  • Simulating the behavior of the human brain on GPUS 

    Valero-Lara, Pedro; Martinez-Perez, Ivan; Sirvent, Raul; Pena, A. J.; Martorell Bofill, Xavier; Labarta Mancho, Jesús José (2018-01-01)
    Article
    Open Access
    The simulation of the behavior of the Human Brain is one of the most important challenges in computing today. The main problem consists of finding efficient ways to manipulate and compute the huge volume of data that this ...
  • The Design and Performance of Batched BLAS on Modern High-Performance Computing Systems 

    Dongarra, Jack; Hammarling, Sven; Higham, Nicholas J.; Relton, Samuel D.; Valero-Lara, Pedro; Zounon, Mawussi (Elsevier, 2017)
    Article
    Open Access
    A current trend in high-performance computing is to decompose a large linear algebra problem into batches containing thousands of smaller problems, that can be solved independently, before collating the results. To standardize ...
  • Towards HPC-Embedded. Case Study: Kalray and Message-Passing on NoC 

    Valero-Lara, Pedro; Krishnasamy, Ezhilmathi; Jansson, Johan (Universitatea de Vest, 2017-06)
    Article
    Open Access
    Today one of the most important challenges in HPC is the development of computers with a low power consumption. In this context, recently, new embedded many-core systems have emerged. One of them is Kalray. Unlike other ...
  • Variable batched DGEMM 

    Valero-Lara, Pedro; Martinez-Perez, Ivan; Mateo, Sergio; Sirvent Pardell, Raül; Beltran Querol, Vicenç; Martorell Bofill, Xavier; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2018)
    Conference report
    Restricted access - publisher's policy
    Many scientific applications are in need to solve a high number of small-size independent problems. These individual problems do not provide enough parallelism and then, these must be computed as a batch. Today, vendors ...