• Analyzing the efficiency of hybrid codes 

      Giménez Lucas, Judit; Mercadal, Estanislao; Llort, German; Méndez, Sandra (Institute of Electrical and Electronics Engineers (IEEE), 2020)
      Text en actes de congrés
      Accés obert
      Hybrid parallelization may be the only path for most codes to use HPC systems on a very large scale. Even within a small scale, with an increasing number of cores per node, combining MPI with some shared memory thread-based ...
    • Automating the application data placement in hybrid memory systems 

      Servat, Harald; Peña, Antonio J.; Llort, German; Mercadal, Estanislao; Hoppe, Hans-Christian; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2017)
      Text en actes de congrés
      Accés obert
      Multi-tiered memory systems, such as those based on Intel® Xeon Phi™processors, are equipped with several memory tiers with different characteristics including, among others, capacity, access latency, bandwidth, energy ...
    • Bio-inspired call-stack reconstruction for performance analysis 

      Servat, Harald; Llort, German; González, Juan; Giménez Lucas, Judit; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2016)
      Text en actes de congrés
      Accés obert
      The correlation of performance bottlenecks and their associated source code has become a cornerstone of performance analysis. It allows understanding why the efficiency of an application falls behind the computer's peak ...
    • Framework for a productive performance optimization 

      Servat, Harald; Llort, German; Huck, Kevin A.; Giménez Lucas, Judit; Labarta Mancho, Jesús José (2013-08)
      Article
      Accés restringit per política de l'editorial
      Modern supercomputers deliver large computational power, but it is difficult for an application to exploit such power. One factor that limits the application performance is the single node performance. While many performance ...
    • Large-memory nodes for energy efficient high-performance computing 

      Živanovič, Darko; Radulović, Milan; Llort, German; Zaragoza, David; Strassburg, Janko; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2016)
      Text en actes de congrés
      Accés obert
      Energy consumption is by far the most important contributor to HPC cluster operational costs, and it accounts for a significant share of the total cost of ownership. Advanced energy-saving techniques in HPC components have ...
    • The secrets of the accelerators unveiled: tracing heterogeneous executions through OMPT 

      Llort, German; Filgueras Izquierdo, Antonio; Jiménez-González, Daniel; Servat, Harald; Teruel, Xavier; Mercadal, Estanislao; Álvarez, Carlos; Giménez, Judit; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Springer, 2016)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Heterogeneous systems are an important trend in the future of supercomputers, yet they can be hard to program and developers still lack powerful tools to gain understanding about how well their accelerated codes perform ...