Browsing by Author "Radojković, Petar"
Now showing items 1-20 of 32
-
Another trip to the wall: how much will stacked DRAM benefit HPC?
Radulović, Milan; Živanovič, Darko; Ruiz, Daniel; De Supinski, Bronis; McKee, Sally; Radojković, Petar; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2015)
Conference report
Restricted access - publisher's policyFirst defined two decades ago, the memory wall remains a fundamental limitation to system performance. Recent innovations in 3D-stacking technology enable DRAM devices with much higher bandwidths than traditional DIMMs. ... -
Characterizing the resource-sharing levels of the UltraSparc T2 processor
Cakarevic, Vladimir; Radojković, Petar; Verdú Mulà, Javier; Pajuelo González, Manuel Alejandro; Cazorla Almeida, Francisco Javier; Nemirovsky, Mario; Valero Cortés, Mateo (Association for Computing Machinery (ACM), 2009)
Conference report
Restricted access - publisher's policyThread level parallelism (TLP) has become a popular trend to improve processor performance, overcoming the limitations of extracting instruction level parallelism. Each TLP paradigm, such as Simultaneous Multithreading or ... -
Cost-aware prediction of uncorrected DRAM errors in the field
Boixaderas Coderch, Isaac; Živanovič, Darko; Moré Codina, Sergi; Bartolomé Rodríguez, Javier; Vicente Dorca, David; Casas, Marc; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2020)
Conference report
Open AccessThis paper presents and evaluates a method to predict DRAM uncorrected errors, a leading cause of hardware failures in large-scale HPC clusters. The method uses a random forest classifier, which was trained and evaluated ... -
Cost-aware prediction of uncorrected DRAM errors in the field
Boixaderas, Isaac; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (Barcelona Supercomputing Center, 2021-05)
Conference report
Open AccessOne of the main causes of hardware failure in large-scale clusters is an uncorrected error in main memory [1]–[4]. Node failures are especially problematic in high-performance computing (HPC), where a single tightly-coupled ... -
Detailed tuning and validation of hardware simulators through microbenchmarks
Sánchez Verdejo, Rommel; Radojković, Petar (Barcelona Supercomputing Center, 2018-04-24)
Conference report
Open Access -
DRAM errors in the field: a statistical approach
Živanovič, Darko; Esmaili Dokht, Pouya; Moré, Sergi; Bartolomé, Javier; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2019)
Conference report
Open AccessThis paper summarizes our two-year study of corrected and uncor-rected errors on the MareNostrum 3 supercomputer, covering 2000 billion MB-hours of DRAM in the field. The study analyzes 4.5 million corrected and 71 uncorrected ... -
Enabling a reliable STT-MRAM main memory simulation
Asifuzzaman, Kazi; Sánchez-Verdejo, Rommel; Radojković, Petar (Association for Computing Machinery, 2017-10)
Conference lecture
Open AccessSTT-MRAM is a promising new memory technology with very desirable set of properties such as non-volatility, byte-addressability and high endurance. It has the potential to become the universal memory that could be incorporated ... -
Enabling a reliable STT-MRAM main memory simulation
Asifuzzaman, Kazi; Sánchez Verdejo, Rommel; Radojković, Petar (Barcelona Supercomputing Center, 2018-04-24)
Conference report
Open Access -
Energy efficient HPC on embedded SoCs : optimization techniques for mali GPU
Grasso, Ivan; Radojković, Petar; Rajovic, Nikola; Gelado Fernandez, Isaac; Ramírez Bellido, Alejandro (Institute of Electrical and Electronics Engineers (IEEE), 2014)
Conference report
Restricted access - publisher's policyA lot of effort from academia and industry has been invested in exploring the suitability of low-power embedded technologies for HPC. Although state-of-the-art embedded systems-on-chip (SoCs) inherently contain GPUs that ... -
HPC benchmarking: scaling right and looking beyond the average
Radulović, Milan; Asifuzzaman, Kazi; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (Springer, 2018)
Conference report
Open AccessDesigning a balanced HPC system requires an understanding of the dominant performance bottlenecks. There is as yet no well established methodology for a unified evaluation of HPC systems and workloads that quantifies the ... -
Improving the effective use of multithreaded architectures : implications on compilation, thread assignment, and timing analysis
Radojković, Petar (Universitat Politècnica de Catalunya, 2013-07-19)
Doctoral thesis
Open AccessThis thesis presents cross-domain approaches that improve the effective use of multithreaded architectures. The contributions of the thesis can be classified in three groups. First, we propose several methods for thread ... -
Large-memory nodes for energy efficient high-performance computing
Živanovič, Darko; Radulović, Milan; Llort, German; Zaragoza, David; Strassburg, Janko; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2016)
Conference report
Open AccessEnergy consumption is by far the most important contributor to HPC cluster operational costs, and it accounts for a significant share of the total cost of ownership. Advanced energy-saving techniques in HPC components have ... -
Main memory in HPC: do we need more or could we live with less?
Živanovič, Darko; Radojković, Petar; Ayguadé Parra, Eduard (Barcelona Supercomputing Center, 2017-05-04)
Conference report
Open AccessThis study analyzes the memory capacity requirements of important HPC benchmarks and applications. We find that the High Performance Conjugate Gradients benchmark could be an important success story for 3D-stacked memories ... -
Main memory in HPC: do we need more, or could we live with less?
Živanovič, Darko; Pavlovic, Milan; Radulović, Milan; Shin, Hyunsung; Son, Jongpil; McKee, Sally A.; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (2017-03)
Article
Open AccessAn important aspect of High-Performance Computing (HPC) system design is the choice of main memory capacity. This choice becomes increasingly important now that 3D-stacked memories are entering the market. Compared with ... -
Main memory latency simulation: the missing link
Sánchez Verdejo, Rommel; Asifuzzaman, Kazi; Radulović, Milan; Radojković, Petar; Ayguadé Parra, Eduard; Jacob, Bruce (Association for Computing Machinery (ACM), 2018)
Conference report
Open AccessThe community accepted the need for a detailed simulation of main memory. Currently, the CPU simulators are usually coupled with the cycle-accurate main memory simulators. However, coupling CPU and memory simulators is not ... -
Mainstream vs. emerging HPC: metrics, trade-offs and lessons learned
Radulović, Milan; Asifuzzaman, Kazi; Živanovič, Darko; Rajovic, Nikola; Colin de Verdiére, Guillaume; Pleiter, Dirk; Marazakis, Manolis; Kallimanis, Nikolaos; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2018)
Conference report
Open AccessVarious servers with different characteristics and architectures are hitting the market, and their evaluation and comparison in terms of HPC features is complex and multidimensional. In this paper, we share our experience ... -
Measuring operating system overhead on CMT processors
Radojković, Petar; Cakarevic, Vladimir; Verdú Mulà, Javier; Pajuelo González, Manuel Alejandro; Gioiosa, Roberto; Cazorla Almeida, Francisco Javier; Nemirovsky, Mario; Valero Cortés, Mateo (IEEE Computer Society Publications, 2008)
Conference report
Open AccessNumerous studies have shown that Operating System (OS) noise is one of the reasons for significant performance degradation in clustered architectures. Although many studies examine the OS noise for High Performance Computing ... -
Measuring operating system overhead on Sun UltraSparc T1 processor
Radojković, Petar; Cakarevic, Vladimir; Verdú Mulà, Javier; Pajuelo González, Manuel Alejandro; Gioiosa, Roberto; Cazorla Almeida, Francisco Javier; Nemirovsky, Mario; Valero Cortés, Mateo (2009-06)
Conference report
Open AccessNumerous studies have shown that Operating System (OS) noise is one of the reasons for significant performance degradation in clustered architectures. Although many studies examine the OS noise for High Performance Computing, ... -
Microbenchmarks for detailed validation and tuning of hardware simulators
Sánchez Verdejo, Rommel; Radojković, Petar (Institute of Electrical and Electronics Engineers (IEEE), 2017)
Conference report
Restricted access - publisher's policyHardware simulators are indispensable tools for the computer architecture research. They are used by the academia and industry to prototype, explore and evaluate novel microarchitectural features. -
Overhead of the spin-lock loop in UltraSPARC T2
Cakarevic, Vladimir; Radojković, Petar; Cazorla Almeida, Francisco Javier; Gioiosa, Roberto; Nemirovsky, Mario; Valero Cortés, Mateo; Pajuelo González, Manuel Alejandro; Verdú Mulà, Javier (2008-06-04)
Conference report
Open AccessSpin locks are task synchronization mechanism used to provide mutual exclusion to shared software resources. Spin locks have a good performance in several situations over other synchronization mechanisms, i.e., when on ...