Now showing items 1-13 of 13

    • Adaptive runtime-assisted block prefetching on chip-multiprocessors 

      García Flores, Víctor; Rico Carro, Alejandro; Villavieja Prados, Carlos; Carpenter, Paul M.; Navarro, Nacho; Ramirez, Alex (2016-04-29)
      Article
      Open Access
      Memory stalls are a significant source of performance degradation in modern processors. Data prefetching is a widely adopted and well studied technique used to alleviate this problem. Prefetching can be performed by the ...
    • Controlling Network Latency in Mixed Hadoop Clusters: Do We Need Active Queue Management? 

      Fischer e Silva, Renan; Carpenter, Paul M. (IEEE, 2016-12-26)
      Conference report
      Open Access
      With the advent of big data, data center applications are processing vast amounts of unstructured and semi-structured data, in parallel on large clusters, across hundreds to thousands of nodes. The highest performance for ...
    • Energy Efficient Ethernet on MapReduce Clusters: Packet Coalescing To Improve 10GbE Links 

      Fischer e Silva, Renan; Carpenter, Paul M. (IEEE, 2017-10)
      Article
      Open Access
      An important challenge of modern data centers is to reduce energy consumption, of which a substantial proportion is due to the network. Switches and NICs supporting the recent energy efficient Ethernet (EEE) standard are ...
    • Exploring interconnect energy savings under East-West traffic pattern of MapReduce clusters 

      Fischer e Silva, Renan; Carpenter, Paul M. (Institute of Electrical and Electronics Engineers (IEEE), 2015-10-26)
      Conference report
      Open Access
      An important challenge of modern data centers is to reduce energy consumption, of which a substantial proportion is due to the network. Energy Efficient Ethernet (EEE) is a recent standard that aims to ...
    • High Throughput and Low Latency on Hadoop Clusters Using Explicit Congestion Notification: The Untold Truth 

      Fischer e Silva, Renan; Carpenter, Paul M. (IEEE, 2017-09-26)
      Conference lecture
      Open Access
      Various extensions of TCP/IP have been proposed to reduce network latency; examples include Explicit Congestion Notification (ECN), Data Center TCP (DCTCP) and several proposals for Active Queue Management (AQM). Combining ...
    • Hipster: hybrid task manager for latency-critical cloud workloads 

      Nishtala, Rajiv; Carpenter, Paul M.; Petrucci, Vinicius; Martorell Bofill, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2017)
      Conference report
      Open Access
      In 2013, U. S. data centers accounted for 2.2% of the country's total electricity consumption, a figure that is projected to increase rapidly over the next decade. Many important workloads are interactive, and they demand ...
    • Interconnect Energy Savings and Lower Latency Networks in Hadoop Clusters: The Missing Link 

      Fischer e Silva, Renan; Carpenter, Paul M. (IEEE, 2017)
      Conference lecture
      Open Access
      An important challenge of modern data centres running Hadoop workloads is to minimise energy consumption, a significant proportion of which is due to the network. Significant network savings are already possible using ...
    • Large-memory nodes for energy efficient high-performance computing 

      Živanovič, Darko; Radulovic, Milan; Llort, German; Zaragoza, David; Strassburg, Janko; Carpenter, Paul M.; Radojkovic, Petar; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2016)
      Conference report
      Open Access
      Energy consumption is by far the most important contributor to HPC cluster operational costs, and it accounts for a significant share of the total cost of ownership. Advanced energy-saving techniques in HPC components have ...
    • Main memory in HPC: do we need more, or could we live with less? 

      Živanovič, Darko; Pavlovic, Milan; Radulovic, Milan; Shin, Hyunsung; Son, Jongpil; McKee, Sally A.; Carpenter, Paul M.; Radojkovic, Petar; Ayguadé Parra, Eduard (2017-03)
      Article
      Open Access
      An important aspect of High-Performance Computing (HPC) system design is the choice of main memory capacity. This choice becomes increasingly important now that 3D-stacked memories are entering the market. Compared with ...
    • PerfBound: Conserving Energy with Bounded Overheads in On/Off-Based HPC Interconnects 

      Saravanan, Karthikeyan P.; Carpenter, Paul M. (IEEE, 2018-07-01)
      Article
      Open Access
      Energy and power are key challenges in high-performance computing. System energy efficiency must be significantly improved, and this requires greater efficiency in all subcomponents. An important target of optimization is ...
    • Self-tuned software-managed energy reduction in InfiniBand links 

      Dickov, Branimir; Carpenter, Paul M.; Pericas, Miquel; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2015)
      Conference report
      Restricted access - publisher's policy
      One of the biggest challenges in high-performance computing is to reduce the power and energy consumption. Research in energy efficiency has focused mainly on energy consumption at the node level. Less attention has been ...
    • TCP Proactive Congestion Control for East–West Trffic: the Marking Threshold 

      Fischer e Silva, Renan; Carpenter, Paul M. (Elsevier, 2019-03)
      Working paper
      Restricted access - publisher's policy
      Various extensions of TCP/IP have been proposed to reduce network latency; examples include Explicit Congestion Notification (ECN), Data Center TCP (DCTCP) and several proposals for Active Queue Management (AQM). Combining ...
    • Thread assignment in multicore/multithreaded processors: A statistical approach 

      Radojkovic, Petar; Carpenter, Paul M.; Moreto Planas, Miquel; Cakarevic, Vladimir; Verdú Mulà, Javier; Pajuelo González, Manuel Alejandro; Cazorla Almeida, Francisco Javier; Nemirovsky, Mario; Valero Cortés, Mateo (2016-01-01)
      Article
      Open Access
      The introduction of multicore/multithreaded processors, comprised of a large number of hardware contexts (virtual CPUs) that share resources at multiple levels, has made process scheduling, in particular assignment of ...