Ara es mostren els items 160-179 de 326

    • Java instrumentation suite: accurate analysis of Java threaded applications 

      Guitart Fernández, Jordi; Torres Viñals, Jordi; Ayguadé Parra, Eduard; Oliver, Jose; Labarta Mancho, Jesús José (2000)
      Text en actes de congrés
      Accés obert
      The rapid maturing process of the Java technology is encouraging users the development of portable applications using the Java language. As an important part of the definition of the Java language, the use of threads is ...
    • Large-memory nodes for energy efficient high-performance computing 

      Živanovič, Darko; Radulović, Milan; Llort, German; Zaragoza, David; Strassburg, Janko; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2016)
      Text en actes de congrés
      Accés obert
      Energy consumption is by far the most important contributor to HPC cluster operational costs, and it accounts for a significant share of the total cost of ownership. Advanced energy-saving techniques in HPC components have ...
    • Leveraging OmpSs to exploit hardware accelerators 

      Sainz, Florentino; Mateo Bellido, Sergi; Beltran Querol, Vicenç; Bosque, José L.; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2014)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      CUDA and OpenCL are the most widely used programming models to exploit hardware accelerators. Both programming models provide a C-based programming language to write accelerator kernels and a host API used to glue the host ...
    • Lifetime-sensitive modulo scheduling in a production environment 

      Llosa Espuny, José Francisco; Ayguadé Parra, Eduard; González Colás, Antonio María; Valero Cortés, Mateo; Eckhardt, Jason (2001-03)
      Article
      Accés obert
      This paper presents a novel software pipelining approach, which is called Swing Modulo Scheduling (SMS). It generates schedules that are near optimal in terms of initiation interval, register requirements, and stage count. ...
    • Lightning talks of EduHPC 2022 

      Qasem, Apan; Anzt, Hartwig; Ayguadé Parra, Eduard; Cahil, Katharine; Canal Corretger, Ramon; Chan, Jany; Fosler-Lussier, Eric; Llosa Espuny, José Francisco; Martorell Bofill, Xavier; Sancho Samsó, María Ribera (Institute of Electrical and Electronics Engineers (IEEE), 2022)
      Text en actes de congrés
      Accés obert
      The lightning talks at EduHPC provide an opportunity to share early results and insights on parallel and distributed computing (PDC) education and training efforts. The four lightning talks at EduHPC 2022 cover a range of ...
    • Loop level speculation in a task based programming model 

      Gayatri, Rahulkumar; Badia Sala, Rosa Maria; Ayguadé Parra, Eduard (2013)
      Text en actes de congrés
      Accés obert
      Uncountable loops (such as while loops in C) and if-conditions are some of the most common constructs in programming. While-loops are widely used to determine the convergence in linear algebra algorithms or goal finding ...
    • Loop parallelization: revisiting framework of unimodular transformations 

      Torres Viñals, Jordi; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Valero Cortés, Mateo (Institute of Electrical and Electronics Engineers (IEEE), 1996)
      Text en actes de congrés
      Accés obert
      The paper extends the framework of linear loop transformations adding a new nonlinear step at the transformation process. The current framework of linear loop transformation cannot identify a significant fraction of ...
    • Low-latency multi-threaded ensemble learning for dynamic big data streams 

      Marron, Diego; Ayguadé Parra, Eduard; Herrero Zaragoza, José Ramón; Read, Jesse; Bifet Figuerol, Albert Carles (Institute of Electrical and Electronics Engineers (IEEE), 2018)
      Text en actes de congrés
      Accés obert
      Real–time mining of evolving data streams involves new challenges when targeting today’s application domains such as the Internet of the Things: increasing volume, velocity and volatility requires data to be processed ...
    • MACC: Mercurium ACCelerator Model 

      Ozen, Guray; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Barcelona Supercomputing Center, 2015-05-05)
      Text en actes de congrés
      Accés obert
      GPU Offloading is emergent programming model. OpenMP includes in its latest 4.0 specification the accelerator model. In this paper we present a newly implementation of this specification while generationg "native" GPU ...
    • Main memory in HPC: do we need more or could we live with less? 

      Živanovič, Darko; Radojković, Petar; Ayguadé Parra, Eduard (Barcelona Supercomputing Center, 2017-05-04)
      Text en actes de congrés
      Accés obert
      This study analyzes the memory capacity requirements of important HPC benchmarks and applications. We find that the High Performance Conjugate Gradients benchmark could be an important success story for 3D-stacked memories ...
    • Main memory in HPC: do we need more, or could we live with less? 

      Živanovič, Darko; Pavlovic, Milan; Radulović, Milan; Shin, Hyunsung; Son, Jongpil; McKee, Sally A.; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (2017-03)
      Article
      Accés obert
      An important aspect of High-Performance Computing (HPC) system design is the choice of main memory capacity. This choice becomes increasingly important now that 3D-stacked memories are entering the market. Compared with ...
    • Main memory latency simulation: the missing link 

      Sánchez Verdejo, Rommel; Asifuzzaman, Kazi; Radulović, Milan; Radojković, Petar; Ayguadé Parra, Eduard; Jacob, Bruce (Association for Computing Machinery (ACM), 2018)
      Text en actes de congrés
      Accés obert
      The community accepted the need for a detailed simulation of main memory. Currently, the CPU simulators are usually coupled with the cycle-accurate main memory simulators. However, coupling CPU and memory simulators is not ...
    • Mainstream vs. emerging HPC: metrics, trade-offs and lessons learned 

      Radulović, Milan; Asifuzzaman, Kazi; Živanovič, Darko; Rajovic, Nikola; Colin de Verdiére, Guillaume; Pleiter, Dirk; Marazakis, Manolis; Kallimanis, Nikolaos; Carpenter, Paul Matthew; Radojković, Petar; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2018)
      Text en actes de congrés
      Accés obert
      Various servers with different characteristics and architectures are hitting the market, and their evaluation and comparison in terms of HPC features is complex and multidimensional. In this paper, we share our experience ...
    • Managing SLAs of heterogeneous workloads using dynamic application placement 

      Carrera Pérez, David; Steinder, Malgorzata; Whalley, Ian; Torres Viñals, Jordi; Ayguadé Parra, Eduard (ACM Press, NY, 2008)
      Text en actes de congrés
      Accés obert
      In this paper we address the problem of managing heterogeneous workloads in a virtualized data center. We consider two different workloads: transactional applications and long-running jobs. We present a technique that ...
    • MAPC: memory access pattern based controller 

      Hussain, Tassadaq; Palomar Pérez, Óscar; Unsal, Osman Sabri; Cristal Kestelman, Adrián; Ayguadé Parra, Eduard; Valero Cortés, Mateo (Institute of Electrical and Electronics Engineers (IEEE), 2014)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Traditionally, system designers have attempted to improve system performance by scheduling the processing cores and by exploring different memory system configurations and there is comparatively less work done scheduling ...
    • MAPC: memory access pattern based controller 

      Hussain, Tassadaq; Palomar Pérez, Óscar; Unsal, Osman Sabri; Cristal Kestelman, Adrián; Ayguadé Parra, Eduard; Valero Cortés, Mateo (Institute of Electrical and Electronics Engineers (IEEE), 2014)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Traditionally, system designers have attempted to improve system performance by scheduling the processing cores and by exploring different memory system configurations and there is comparatively less work done scheduling ...
    • Mapping stream programs onto heterogeneous multiprocessor systems 

      Carpenter, Paul Matthew; Ramírez Bellido, Alejandro; Ayguadé Parra, Eduard (ACM Press, NY, 2009)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      This paper presents a partitioning and allocation algorithm for an iterative stream compiler, targeting heterogeneous multiprocessors with constrained distributed memory and any communications topology. We introduce a ...
    • MASA: a multi-platform architecture for sequence aligners with block pruning 

      De Sandes, Edans; Miranda, Guillermo; Martorell, Xavier; Ayguadé Parra, Eduard; Teodoro, George; de Melo, Alba (2016-03-01)
      Article
      Accés obert
      Biological sequence alignment is a very popular application in Bioinformatics used routinely worldwide. Many implementations of biological sequence alignment algorithms have been proposed for multicores, GPUs, FPGAs and ...
    • Memory controller for vector processor 

      Hussain, Tassadaq; Palomar, Oscar; Unsal, Osman Sabri; Cristal Kestelman, Adrián; Ayguadé Parra, Eduard (Springer, 2018-11)
      Article
      Accés obert
      To manage power and memory wall affects, the HPC industry supports FPGA reconfigurable accelerators and vector processing cores for data-intensive scientific applications. FPGA based vector accelerators are used to increase ...
    • MetH: A family of high-resolution and variable-shape image challenges 

      Parés Pont, Ferran; Garcia Gasulla, Dario; Servat, Harald; Labarta Mancho, Jesús José; Ayguadé Parra, Eduard (2019-11-20)
      Report de recerca
      Accés obert
      High-resolution and variable-shape images have not yet been properly addressed by the AI community. The approach of down-sampling data often used with convolutional neural networks is sub-optimal for many tasks, and has ...