Now showing items 1-12 of 238

    • Boosting LSTM performance through dynamic precision selection 

      Silfa Feliz, Franyell Antonio; Arnau Montañés, José María; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2020)
      Conference report
      Open Access
      The use of low numerical precision is a fundamental optimization included in modern accelerators for Deep Neural Networks (DNNs). The number of bits of the numerical representation is set to the minimum precision that is ...
    • LAWS: Locality-AWare Scheme for automatic speech recognition 

      Yazdani Aminabadi, Reza; Arnau Montañés, José María; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2020-08-01)
      Article
      Open Access
      Automatic Speech Recognition (ASR) systems are changing the way people interact with different applications on mobile devices. Fulfilling such user-interactivity requires not only a highly accurate, large-vocabulary ...
    • Design and evaluation of an ultra low-power human-quality speech recognition system 

      Pinto Rivero, Daniel; Arnau Montañés, José María; González Colás, Antonio María (2020-11)
      Article
      Open Access
      Automatic Speech Recognition (ASR) has experienced a dramatic evolution since pioneer development of Bell Lab’s single-digit recognizer more than 50 years ago. Current ASR systems have taken advantage of the tremendous ...
    • Demystifying power and performance bottlenecks in autonomous driving systems 

      Exenberger Becker, Pedro Henrique; Arnau Montañés, José María; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2020)
      Conference report
      Open Access
      Autonomous Vehicles (AVs) have the potential to radically change the automotive industry. However, computing solutions for AVs have to meet severe performance and power constraints to guarantee a safe driving experience. ...
    • Tendencias en la microarquitectura de los procesadores 

      González Colás, Antonio María (Asociación de Técnicos de Informática, 2000-05)
      Article
      Open Access
      En este artículo se revisa la microarquitectura de los procesadores actuales. Seguidamente de presentan las principales expectativas en la evolución de la tecnología. A continuación de destacan las principales limitaciones ...
    • DRAM errors in the field: a statistical approach 

      Živanovič, Darko; Esmaili Dokht, Pouya; Moré, Sergi; Bartolomé, Javier; Carpenter, Paul Matthew; Radojkovic, Petar; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2019)
      Conference report
      Open Access
      This paper summarizes our two-year study of corrected and uncor-rected errors on the MareNostrum 3 supercomputer, covering 2000 billion MB-hours of DRAM in the field. The study analyzes 4.5 million corrected and 71 uncorrected ...
    • Multi-objective interior design optimization method based on sustainability concepts for post-disaster temporary housing units 

      Hosseini, Seyed Mohammad Amin; Yazdani Aminabadi, Reza; Fuente Antequera, Albert de la (2020-04)
      Article
      Restricted access - publisher's policy
      Temporary housing units (THUs), which are provided after disasters, are crucial in terms of sustainability pillars (economic, social, and environmental). In general, THUs, which are regular houses with minimum space and ...
    • Neuron-level fuzzy memoization in RNNs 

      Silfa Feliz, Franyell Antonio; Dot Artigas, Gem; Arnau Montañés, José María; González Colás, Antonio María (Association for Computing Machinery (ACM), 2019)
      Conference report
      Open Access
      Recurrent Neural Networks (RNNs) are a key technology for applications such as automatic speech recognition or machine translation. Unlike conventional feed-forward DNNs, RNNs remember past information to improve the ...
    • Leveraging run-time feedback for efficient ASR acceleration 

      Yazdani, Reza; Arnau Montañés, José María; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2019)
      Conference report
      Open Access
      In this work, we propose Locality-AWare-Scheme (LAWS) for an Automatic Speech Recognition (ASR) accelerator in order to significantly reduce its energy consumption and memory requirements, by leveraging the locality among ...
    • SCU: a GPU stream compaction unit for graph processing 

      Segura Salvador, Albert; Arnau Montañés, José María; González Colás, Antonio María (Association for Computing Machinery (ACM), 2019)
      Conference report
      Restricted access - publisher's policy
      Graph processing algorithms are key in many emerging applications in areas such as machine learning and data analytics. Although the processing of large scale graphs exhibits a high degree of parallelism, the memory access ...
    • A low-power, high-performance speech recognition accelerator 

      Yazdani, Reza; Arnau Montañés, José María; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2019-12-01)
      Article
      Open Access
      Automatic Speech Recognition (ASR) is becoming increasingly ubiquitous, especially in the mobile segment. Fast and accurate ASR comes at high energy cost, not being affordable for the tiny power-budgeted mobile devices. ...
    • CGPA: Coarse-Grained Pruning of Activations for Energy-Efficient RNN Inference 

      Riera Villanueva, Marc; Arnau Montañés, José María; González Colás, Antonio María (2019-09-01)
      Article
      Open Access
      Recurrent neural networks (RNNs) perform element-wise multiplications across the activations of gates. We show that a significant percentage of activations are saturated and propose coarse-grained pruning of activations ...