Now showing items 1-9 of 9

    • Contention-aware application performance prediction for disaggregated memory systems 

      Vieira Zacarias, Felippe; Nishtala, Rajiv; Carpenter, Paul Matthew (Association for Computing Machinery (ACM), 2020)
      Conference report
      Open Access
      Disaggregated memory has recently been proposed as a way to allow flexible and fine-grained allocation of memory capacity to compute jobs. This paper makes an important step towards effective resource allocation on ...
    • Continuous-action reinforcement learning for memory allocation in virtualized servers 

      Garrido Platero, Luis Ángel; Nishtala, Rajiv; Carpenter, Paul Matthew (Springer, 2019)
      Conference report
      Open Access
      In a virtualized computing server (node) with multiple Virtual Machines (VMs), it is necessary to dynamically allocate memory among the VMs. In many cases, this is done only considering the memory demand of each VM without ...
    • DRAM errors in the field: a statistical approach 

      Živanovič, Darko; Esmaili Dokht, Pouya; Moré, Sergi; Bartolomé, Javier; Carpenter, Paul Matthew; Radojkovic, Petar; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2019)
      Conference report
      Open Access
      This paper summarizes our two-year study of corrected and uncor-rected errors on the MareNostrum 3 supercomputer, covering 2000 billion MB-hours of DRAM in the field. The study analyzes 4.5 million corrected and 71 uncorrected ...
    • HPC benchmarking: scaling right and looking beyond the average 

      Radulovic, Milan; Asifuzzaman, Kazi; Carpenter, Paul Matthew; Radojkovic, Petar; Ayguadé Parra, Eduard (Springer, 2018)
      Conference report
      Open Access
      Designing a balanced HPC system requires an understanding of the dominant performance bottlenecks. There is as yet no well established methodology for a unified evaluation of HPC systems and workloads that quantifies the ...
    • LEGaTO: Low-energy, secure, and resilient toolset for heterogeneous computing 

      Salami, Behzad; Parasyris, Konstantinos; Cristal Kestelman, Adrián; Unsal, Osman Sabri; Martorell Bofill, Xavier; Carpenter, Paul Matthew; De la Cruz Martínez, Raul; Bautista Gomez, Leonardo Arturo; Jiménez González, Daniel; Álvarez Martínez, Carlos; Nabavilarimi, Seyed Saber; Madonar Soria, Sergi (Institute of Electrical and Electronics Engineers (IEEE), 2020)
      Conference report
      Open Access
      The LEGaTO project leverages task-based programming models to provide a software ecosystem for Made in-Europe heterogeneous hardware composed of CPUs, GPUs, FPGAs and dataflow engines. The aim is to attain one order of ...
    • Mainstream vs. emerging HPC: metrics, trade-offs and lessons learned 

      Radulović, Milan; Asifuzzaman, Kazi; Živanovič, Darko; Rajovic, Nikola; Colin de Verdiére, Guillaume; Pleiter, Dirk; Marazakis, Manolis; Kallimanis, Nikolaos; Carpenter, Paul Matthew; Radojkovic, Petar; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2018)
      Conference report
      Open Access
      Various servers with different characteristics and architectures are hitting the market, and their evaluation and comparison in terms of HPC features is complex and multidimensional. In this paper, we share our experience ...
    • PROFET: modeling system performance and energy without simulating the CPU 

      Radulovic, Milan; Sánchez-Verdejo, Rommel; Carpenter, Paul Matthew; Radojkovic, Petar; Jacob, Bruce; Ayguadé Parra, Eduard (2019-06)
      Article
      Open Access
      The approaching end of DRAM scaling and expansion of emerging memory technologies is motivating a lot of research in future memory systems. Novel memory systems are typically explored by hardware simulators that are slow ...
    • The hipster approach for improving cloud system efficiency 

      Nishtala, Rajiv; Carpenter, Paul Matthew; Petrucci, Vinicius; Martorell Bofill, Xavier (2017-12-29)
      Article
      Restricted access - publisher's policy
      In 2013, U.S. data centers accounted for 2.2% of the country’s total electricity consumption, a figure that is projected to increase rapidly over the next decade. Many important data center workloads in cloud computing are ...
    • Towards resilient EU HPC systems: A blueprint 

      Radojkovic, Petar; Marazakis, Manolis; Carpenter, Paul Matthew; Jeyapaul, Reiley; Gizopoulos, Dimitris; Schulz, Martin; Armejach Sanosa, Adrià; Ayguadé Parra, Eduard; Canal Corretger, Ramon; Moreto Planas, Miquel; Salami, Behzad; Unsal, Osman Sabri (2020-04)
      External research report
      Open Access
      This document aims to spearhead a Europe-wide discussion on HPC system resilience and to help the European HPC community define best practices for resilience. We analyse a wide range of state-of-the-art resilience mechanisms ...