Ara es mostren els items 1-20 de 32

    • Another trip to the wall: how much will stacked DRAM benefit HPC? 

      Radulović, Milan; Živanovič, Darko; Ruiz, Daniel; De Supinski, Bronis; McKee, Sally; Radojković, Petar; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2015)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      First defined two decades ago, the memory wall remains a fundamental limitation to system performance. Recent innovations in 3D-stacking technology enable DRAM devices with much higher bandwidths than traditional DIMMs. ...
    • Characterizing the resource-sharing levels of the UltraSparc T2 processor 

      Cakarevic, Vladimir; Radojković, Petar; Verdú Mulà, Javier; Pajuelo González, Manuel Alejandro; Cazorla Almeida, Francisco Javier; Nemirovsky, Mario; Valero Cortés, Mateo (Association for Computing Machinery (ACM), 2009)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Thread level parallelism (TLP) has become a popular trend to improve processor performance, overcoming the limitations of extracting instruction level parallelism. Each TLP paradigm, such as Simultaneous Multithreading or ...
    • Cost-aware prediction of uncorrected DRAM errors in the field 

      Boixaderas Coderch, Isaac; Živanovič, Darko; Moré Codina, Sergi; Bartolomé Rodríguez, Javier; Vicente Dorca, David; Casas, Marc; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2020)
      Text en actes de congrés
      Accés obert
      This paper presents and evaluates a method to predict DRAM uncorrected errors, a leading cause of hardware failures in large-scale HPC clusters. The method uses a random forest classifier, which was trained and evaluated ...
    • Cost-aware prediction of uncorrected DRAM errors in the field 

      Boixaderas, Isaac; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (Barcelona Supercomputing Center, 2021-05)
      Text en actes de congrés
      Accés obert
      One of the main causes of hardware failure in large-scale clusters is an uncorrected error in main memory [1]–[4]. Node failures are especially problematic in high-performance computing (HPC), where a single tightly-coupled ...
    • Detailed tuning and validation of hardware simulators through microbenchmarks 

      Sánchez Verdejo, Rommel; Radojković, Petar (Barcelona Supercomputing Center, 2018-04-24)
      Text en actes de congrés
      Accés obert
    • DRAM errors in the field: a statistical approach 

      Živanovič, Darko; Esmaili Dokht, Pouya; Moré, Sergi; Bartolomé, Javier; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2019)
      Text en actes de congrés
      Accés obert
      This paper summarizes our two-year study of corrected and uncor-rected errors on the MareNostrum 3 supercomputer, covering 2000 billion MB-hours of DRAM in the field. The study analyzes 4.5 million corrected and 71 uncorrected ...
    • Enabling a reliable STT-MRAM main memory simulation 

      Asifuzzaman, Kazi; Sánchez-Verdejo, Rommel; Radojković, Petar (Association for Computing Machinery, 2017-10)
      Comunicació de congrés
      Accés obert
      STT-MRAM is a promising new memory technology with very desirable set of properties such as non-volatility, byte-addressability and high endurance. It has the potential to become the universal memory that could be incorporated ...
    • Enabling a reliable STT-MRAM main memory simulation 

      Asifuzzaman, Kazi; Sánchez Verdejo, Rommel; Radojković, Petar (Barcelona Supercomputing Center, 2018-04-24)
      Text en actes de congrés
      Accés obert
    • Energy efficient HPC on embedded SoCs : optimization techniques for mali GPU 

      Grasso, Ivan; Radojković, Petar; Rajovic, Nikola; Gelado Fernandez, Isaac; Ramírez Bellido, Alejandro (Institute of Electrical and Electronics Engineers (IEEE), 2014)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      A lot of effort from academia and industry has been invested in exploring the suitability of low-power embedded technologies for HPC. Although state-of-the-art embedded systems-on-chip (SoCs) inherently contain GPUs that ...
    • HPC benchmarking: scaling right and looking beyond the average 

      Radulović, Milan; Asifuzzaman, Kazi; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (Springer, 2018)
      Text en actes de congrés
      Accés obert
      Designing a balanced HPC system requires an understanding of the dominant performance bottlenecks. There is as yet no well established methodology for a unified evaluation of HPC systems and workloads that quantifies the ...
    • Improving the effective use of multithreaded architectures : implications on compilation, thread assignment, and timing analysis 

      Radojković, Petar (Universitat Politècnica de Catalunya, 2013-07-19)
      Tesi
      Accés obert
      This thesis presents cross-domain approaches that improve the effective use of multithreaded architectures. The contributions of the thesis can be classified in three groups. First, we propose several methods for thread ...
    • Large-memory nodes for energy efficient high-performance computing 

      Živanovič, Darko; Radulović, Milan; Llort, German; Zaragoza, David; Strassburg, Janko; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2016)
      Text en actes de congrés
      Accés obert
      Energy consumption is by far the most important contributor to HPC cluster operational costs, and it accounts for a significant share of the total cost of ownership. Advanced energy-saving techniques in HPC components have ...
    • Main memory in HPC: do we need more or could we live with less? 

      Živanovič, Darko; Radojković, Petar; Ayguadé Parra, Eduard (Barcelona Supercomputing Center, 2017-05-04)
      Text en actes de congrés
      Accés obert
      This study analyzes the memory capacity requirements of important HPC benchmarks and applications. We find that the High Performance Conjugate Gradients benchmark could be an important success story for 3D-stacked memories ...
    • Main memory in HPC: do we need more, or could we live with less? 

      Živanovič, Darko; Pavlovic, Milan; Radulović, Milan; Shin, Hyunsung; Son, Jongpil; McKee, Sally A.; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (2017-03)
      Article
      Accés obert
      An important aspect of High-Performance Computing (HPC) system design is the choice of main memory capacity. This choice becomes increasingly important now that 3D-stacked memories are entering the market. Compared with ...
    • Main memory latency simulation: the missing link 

      Sánchez Verdejo, Rommel; Asifuzzaman, Kazi; Radulović, Milan; Radojković, Petar; Ayguadé Parra, Eduard; Jacob, Bruce (Association for Computing Machinery (ACM), 2018)
      Text en actes de congrés
      Accés obert
      The community accepted the need for a detailed simulation of main memory. Currently, the CPU simulators are usually coupled with the cycle-accurate main memory simulators. However, coupling CPU and memory simulators is not ...
    • Mainstream vs. emerging HPC: metrics, trade-offs and lessons learned 

      Radulović, Milan; Asifuzzaman, Kazi; Živanovič, Darko; Rajovic, Nikola; Colin de Verdiére, Guillaume; Pleiter, Dirk; Marazakis, Manolis; Kallimanis, Nikolaos; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2018)
      Text en actes de congrés
      Accés obert
      Various servers with different characteristics and architectures are hitting the market, and their evaluation and comparison in terms of HPC features is complex and multidimensional. In this paper, we share our experience ...
    • Measuring operating system overhead on CMT processors 

      Radojković, Petar; Cakarevic, Vladimir; Verdú Mulà, Javier; Pajuelo González, Manuel Alejandro; Gioiosa, Roberto; Cazorla Almeida, Francisco Javier; Nemirovsky, Mario; Valero Cortés, Mateo (IEEE Computer Society Publications, 2008)
      Text en actes de congrés
      Accés obert
      Numerous studies have shown that Operating System (OS) noise is one of the reasons for significant performance degradation in clustered architectures. Although many studies examine the OS noise for High Performance Computing ...
    • Measuring operating system overhead on Sun UltraSparc T1 processor 

      Radojković, Petar; Cakarevic, Vladimir; Verdú Mulà, Javier; Pajuelo González, Manuel Alejandro; Gioiosa, Roberto; Cazorla Almeida, Francisco Javier; Nemirovsky, Mario; Valero Cortés, Mateo (2009-06)
      Text en actes de congrés
      Accés obert
      Numerous studies have shown that Operating System (OS) noise is one of the reasons for significant performance degradation in clustered architectures. Although many studies examine the OS noise for High Performance Computing, ...
    • Microbenchmarks for detailed validation and tuning of hardware simulators 

      Sánchez Verdejo, Rommel; Radojković, Petar (Institute of Electrical and Electronics Engineers (IEEE), 2017)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Hardware simulators are indispensable tools for the computer architecture research. They are used by the academia and industry to prototype, explore and evaluate novel microarchitectural features.
    • Overhead of the spin-lock loop in UltraSPARC T2 

      Cakarevic, Vladimir; Radojković, Petar; Cazorla Almeida, Francisco Javier; Gioiosa, Roberto; Nemirovsky, Mario; Valero Cortés, Mateo; Pajuelo González, Manuel Alejandro; Verdú Mulà, Javier (2008-06-04)
      Text en actes de congrés
      Accés obert
      Spin locks are task synchronization mechanism used to provide mutual exclusion to shared software resources. Spin locks have a good performance in several situations over other synchronization mechanisms, i.e., when on ...