Recent Submissions

  • An experimental study of reduced-voltage operation in modern FPGAs for neural network acceleration 

    Salami, Behzad; Onural, Erhan Baturay; Yuksel, Ismail Emir; Koc, Fahrettin; Ergin, Oguz; Cristal Kestelman, Adrián; Unsal, Osman Sabri; Sarbazi-Azad, Hamid; Mutlu, Onur (Institute of Electrical and Electronics Engineers (IEEE), 2020)
    Conference report
    Open Access
    We empirically evaluate an undervolting technique, i.e., underscaling the circuit supply voltage below the nominal level, to improve the power-efficiency of Convolutional Neural Network (CNN) accelerators mapped to Field ...
  • POSTER: SPiDRE: accelerating sparse memory access patterns 

    Barredo Ferreira, Adrián; Beard, Jonathan C.; Moreto Planas, Miquel (Institute of Electrical and Electronics Engineers (IEEE), 2019)
    Conference lecture
    Open Access
    Development in process technology has led to an exponential increase in processor speed and memory capacity. However, memory latencies have not improved as dramatically and represent a well-known problem in computer ...
  • An architecture model for a distributed virtualization system 

    Pessolani, Pablo; Tinetti, Fernando; Cortés, Toni; Gonnet, Silvio (International Academy, Research, and Industry Association (IARIA), 2018)
    Conference report
    Restricted access - publisher's policy
    This article presents an architecture model for a Distributed Virtualization System, which could expand a virtual execution environment from a single physical machine to several nodes of a cluster. With current virtualization ...
  • A toolchain to verify the parallelization of OmpSs-2 applications 

    Economo, Simone; Royuela Alcázar, Sara; Ayguadé Parra, Eduard; Beltran Querol, Vicenç (Springer, 2020)
    Conference report
    Open Access
    Programming models for task-based parallelization based on compile-time directives are very effective at uncovering the parallelism available in HPC applications. Despite that, the process of correctly annotating complex ...
  • A novel FPGA-based high throughput accelerator for binary search trees 

    Melikoglu, Oyku; Ergin, Oguz; Salami, Behzad; Pavón Rivera, Julián; Unsal, Osman Sabri; Cristal Kestelman, Adrián (Institute of Electrical and Electronics Engineers (IEEE), 2019)
    Conference report
    Open Access
    This paper presents a deeply pipelined and massively parallel Binary Search Tree (BST) accelerator for Field Programmable Gate Arrays (FPGAs). Our design relies on the extremely parallel on-chip memory, or Block RAMs (BRAMs) ...
  • Towards an auto-tuned and task-based SpMV (LASs Library) 

    Catalán Pallarés, Sandra; Usui, Tetsuzo; Toledo, Leonel; Martorell Bofill, Xavier; Labarta Mancho, Jesús José; Valero Lara, Pedro (Springer, 2020)
    Conference report
    Open Access
    We present a novel approach to parallelize the SpMV kernel included in LASs (Linear Algebra routines on OmpSs) library, after a deep review and analysis of several well-known approaches. LASs is based on OmpSs, a task-based ...
  • Predictive reliability and fault management in exascale systems: State of the art and perspectives 

    Canal Corretger, Ramon; Hernández Luz, Carles; Tornero Gavilá, Rafael; Cilardo, Alessandro; Massari, Giuseppe; Reghenzani, Federico; Fornaciari, William; Zapater Sancho, Marina; Atienza, David; Oleksiak, Ariel; Wojciech Piatek, Poznan; Abella Ferrer, Jaume (2020-09)
    Article
    Open Access
    Performance and power constraints come together with Complementary Metal Oxide Semiconductor technology scaling in future Exascale systems. Technology scaling makes each individual transistor more prone to faults and, due ...
  • An iris based lungs pre-diagnostic system 

    Hussain, Tassadaq; Haider, Amna; Muhammad, Abdul Malik; Agha, Areeb; Khan, Bilal; Rashid, Fawad; Raza, Muhammad Saad; Din, Moainud; Khan, Mehran; Ullah, Sami; Ahmed, Abdelmalik Taleb; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2019)
    Conference report
    Open Access
    Human lungs are essential respiratory organs. Different Obstructive Lung Diseases (OLD) such as bronchitis, asthma, lungs cancer etc. affects the respiration. Diagnosing OLD in the initial stage is better than diagnosing ...
  • Exceeding conservative limits: A consolidated analysis on modern hardware margins 

    Papadimitriou, George; Chatzidimitriou, Athanansios; Gizopoulos, Dimitris; Reddi, Vijay Janapa; Leng, Jingwen; Salami, Behzad; Unsal, Osman Sabri; Cristal Kestelman, Adrián (2020-06)
    Article
    Open Access
    Modern large-scale computing systems (data centers, supercomputers, cloud and edge setups and high-end cyber-physical systems) employ heterogeneous architectures that consist of multicore CPUs, general-purpose many-core ...
  • Aging-aware parallel execution 

    Medeiros, Thiarles S.; Berned, Gustavo P.; Navarro Muñoz, Antoni; Rossi, Fábio D.; Luizelli, Marcelo C.; Brandalero, Marcelo; Hübner, Michael; Beck, Antonio Carlos S.; Lorenzon, Arthur F. (Institute of Electrical and Electronics Engineers (IEEE), 2020-09-04)
    Article
    Open Access
    Computation has been pushed to the edge to decrease latency and alleviate the computational burden of the IoT applications in the cloud. However, the increasing processing demands of Edge Applications make necessary the ...
  • Asynchronous runtime with distributed manager for task-based programming models 

    Bosch Pons, Jaume; Álvarez Martínez, Carlos; Jiménez González, Daniel; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (2020-09)
    Article
    Restricted access - publisher's policy
    Parallel task-based programming models, like OpenMP, allow application developers to easily create a parallel version of their sequential codes. The standard OpenMP 4.0 introduced the possibility of describing a set of ...
  • Semi-automatic validation of cycle-accurate simulation infrastructures: The case for gem5-x86 

    Cebrián González, Juan Manuel; Barredo Ferreira, Adrián; Caminal Pallarés, Helena; Moreto Planas, Miquel; Casas Guix, Marc; Valero Cortés, Mateo (Elsevier, 2020-11)
    Article
    Restricted access - publisher's policy
    Since the early 70s, simulation infrastructures have been a keystone in computer architecture research, providing a fast and reliable way to prototype and evaluate ideas for future computing systems. There are different ...

View more