En aquest grup s´investiga en tècniques que permeten millorar l´eficiència dels sistemes de computació d?altes prestacions. Aquest objectiu es tracta des de perspectives diverses que requereixen un cert grau de cooperació: arquitectura del sistema uniprocessador i multiprocessador, compilador, sistema operatiu, eines d´anàlisi, visualització i predicció, algorismes i aplicacions. Per mesurar l´eficiència es consideren mètriques que van més enllà del temps d´execució dels programes. En particular es consideren aspectes relacionats amb el disseny del sistema (cicle d´operació, àrea i consum de potència del processador i la jerarquia de memòria, escalabilitat de l´organització uniprocessador i multiprocessador), amb la verificació funcional dels sistemes, amb la facilitat i la portabilitat del model de programació i amb el rendiment en entorns multiprogramats i distribuïts, entre altres.

The group aims to improve the efficiency of high-performance computing systems. To that end, it employs a variety of approaches that require a certain level of cooperation and integration: microarchitecture and multiprocessor architecture, compilers, operating systems, analysis, visualisation and prediction tools, algorithms and applications. When measuring efficiency, in addition to the traditional approach that takes the execution time into account, we use metrics that consider design factors such as cycle time, area and power dissipation of the processor and memory hierarchy, scalability of the microarchitecture and multiprocessor organisation, system correctness, portability and ease of use of programming models, and performance when running on multiuser, multiprogrammed and distributed environments, among others.

The group aims to improve the efficiency of high-performance computing systems. To that end, it employs a variety of approaches that require a certain level of cooperation and integration: microarchitecture and multiprocessor architecture, compilers, operating systems, analysis, visualisation and prediction tools, algorithms and applications. When measuring efficiency, in addition to the traditional approach that takes the execution time into account, we use metrics that consider design factors such as cycle time, area and power dissipation of the processor and memory hierarchy, scalability of the microarchitecture and multiprocessor organisation, system correctness, portability and ease of use of programming models, and performance when running on multiuser, multiprogrammed and distributed environments, among others.

Recent Submissions

  • Complete instrumentation requirements for performance analysis of web based technologies 

    Carrera Pérez, David; Guitart Fernández, Jordi; Torres Viñals, Jordi; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2003)
    Conference report
    Open Access
    In this paper we present the eDragon environment, a research platform created to perform complete performance analysis of new Web-based technologies. eDragon enables the understanding of how application servers work in ...
  • Functional and timing implications of transient faults in critical systems 

    Kritikakou, Angeliki; Nikolaou, Panagiota; Rodríguez Ferrández, Iván; Paturel, Joseph; Kosmidis, Leonidas; Michael, Maria K.; Sentieys, Olivier; Steenari, David (Institute of Electrical and Electronics Engineers (IEEE), 2022)
    Conference report
    Open Access
    Embedded systems in critical domains, such as auto-motive, aviation, space domains, are often required to guarantee both functional and temporal correctness. Considering transient faults, fault analysis and mitigation ...
  • Sources of single event effects in the NVIDIA Xavier SoC family under proton irradiation 

    Rodríguez Ferrández, Iván; Tali, Maris; Kosmidis, Leonidas; Rovituso, Marta; Steenari, David (Institute of Electrical and Electronics Engineers (IEEE), 2022)
    Conference report
    Open Access
    In this paper we characterise two embedded GPU devices from the NVIDIA Xavier family System-on-Chip (SoC) using a proton beam. We compare the NVIDIA Xavier NX and Industrial devices, that respectively target commercial and ...
  • An instrumentation tool for threaded Java application servers 

    Carrera Pérez, David; Guitart Fernández, Jordi; Torres Viñals, Jordi; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Universitat de Lleida, 2002)
    Conference report
    Open Access
    Rapid development of e-business services has extended the use of application servers on companies. The Java platform has an important presence on this sector because of its portability and development facilities. Java ...
  • Towards EXtreme scale technologies and accelerators for euROhpc hw/Sw supercomputing applications for exascale: The TEXTAROSSA approach 

    Agosta, Giovanni; Aldinucci, Marco; Álvarez Martínez, Carlos; Ammendola, Roberto; Arfat, Yasir; Beaumont, Olivier; Bernaschi, Massimo; Filgueras Izquierdo, Antonio; Martorell Bofill, Xavier; Vidal, Miquel (2022-11)
    Article
    Restricted access - publisher's policy
    In the near future, Exascale systems will need to bridge three technology gaps to achieve high performance while remaining under tight power constraints: energy efficiency and thermal control; extreme computation efficiency ...
  • OmpSs@cloudFPGA: An FPGA task-based programming model with message passing 

    Haro Ruiz, Juan Miguel de; Cano, Rubén; Álvarez Martínez, Carlos; Jiménez González, Daniel; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Abel, François; Ringlein, Burkhard; Weiss, Beat (Institute of Electrical and Electronics Engineers (IEEE), 2022)
    Conference report
    Open Access
    Nowadays, a new parallel paradigm for energy-efficient heterogeneous hardware infrastructures is required to achieve better performance at a reasonable cost on high-performance computing applications. Under this new paradigm, ...
  • Vector extensions in COTS processors to increase guaranteed performance in real-time systems 

    Pujol Torramorell, Roger; Jorba Jorba, Josep; Tabani, Hamid; Kosmidis, Leonidas; Mezzetti, Enrico; Abella Ferrer, Jaume; Cazorla Almeida, Francisco Javier (2022-08-31)
    Article
    Open Access
    The need for increased application performance in high-integrity systems like those in avionics is on the rise as software continues to implement more complex functionalities. The prevalent computing solution for future ...
  • Accelerating FFT using NEC SX-Aurora vector engine 

    Vizcaino Serrano, Pablo; Mantovani, Filippo; Labarta Mancho, Jesús José (Springer, 2021)
    Conference report
    Restricted access - publisher's policy
    Novel architectures leveraging long and variable vector lengths like the NEC SX-Aurora or the vector extension of RISCV are appearing as promising solutions on the supercomputing market. These architectures often require ...
  • A closer look at referring expressions for video object segmentation 

    Bellver Bueno, Míriam; Ventura Royo, Carles; Silberer, Carina; Kazakos, Ioannis; Torres Viñals, Jordi; Giró Nieto, Xavier (2022-07-27)
    Article
    Open Access
    The task of Language-guided Video Object Segmentation (LVOS) aims at generating binary masks for an object referred by a linguistic expression. When this expression unambiguously describes an object in the scene, it is ...
  • The holistic perspective of the INCISIVE Project: artificial intelligence in screening mammography 

    Lazic, Ivan; Agulló López, Ferran; Aussó Trias, Susanna; Alves, Bruno; Barelle, Caroline; Berral García, Josep Lluís; Bizopoulos, Paschalis; Bunduc, Oana; Gutiérrez Torre, Alberto (Multidisciplinary Digital Publishing Institute, 2022-08-31)
    Article
    Open Access
    Finding new ways to cost-effectively facilitate population screening and improve cancer diagnoses at an early stage supported by data-driven AI models provides unprecedented opportunities to reduce cancer related mortality. ...
  • TEXTAROSSA: Towards EXtreme scale Technologies and Accelerators for euROhpc hw/Sw Supercomputing Applications for exascale 

    Agosta, Giovanni; Cattaneo, Daniele; Fornaciari, William; Galimberti, Andrea; Massari, Giuseppe; Reghenzani, Federico; Terraneo, Federico; Filgueras Izquierdo, Antonio; Vidal, Miquel; Álvarez Martínez, Carlos; Martorell Bofill, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2021)
    Conference report
    Open Access
    To achieve high performance and high energy efficiency on near-future exascale computing systems, three key technology gaps needs to be bridged. These gaps include: energy efficiency and thermal control; extreme computation ...
  • A BF16 FMA is all you need for DNN training 

    Osorio Ríos, John Haiber; Armejach Sanosa, Adrià; Petit, Eric; Henry, Greg; Casas Guix, Marc (Institute of Electrical and Electronics Engineers (IEEE), 2022-07-01)
    Article
    Open Access
    Fused Multiply-Add (FMA) functional units constitute a fundamental hardware component to train Deep Neural Networks (DNNs). Its silicon area grows quadratically with the mantissa bit count of the computer number format, ...

View more