Recent Submissions

  • Performance Evaluation of cuDNN Convolution Algorithms on NVIDIA Volta GPUs 

    Jordà, Marc; Valero-Lara, Pedro; Peña, Antonio J. (IEEE, 2019-05-24)
    Article
    Open Access
    Convolutional neural networks (CNNs) have recently attracted considerable attention due to their outstanding accuracy in applications, such as image recognition and natural language processing. While one advantage of the ...
  • Probabilistic Worst-Case Timing Analysis: Taxonomy and Comprehensive Survey 

    Cazorla, Francisco J.; Kosmidis, Leonidas; Mezzetti, Enrico; Hernandez, Carles; Abella, Jaume; Vardanega, Tullio (ACM, 2019-02-01)
    Article
    Open Access
    The unabated increase in the complexity of the hardware and software components of modern embedded real-time systems has given momentum to a host of research in the use of probabilistic and statistical techniques for timing ...
  • Considerations in using OpenCL on GPUs and FPGAs for throughput-oriented genomics workloads 

    Cadenelli, Nicola; Jaksic, Zoran; Polo, Jordà; Carrera, David (Elsevier, 2019-05)
    Article
    Open Access
    The recent upsurge in the available amount of health data and the advances in next-generation sequencing are setting the ground for the long-awaited precision medicine. To process this deluge of data, bioinformatics workloads ...
  • High-Integrity Performance Monitoring Units in Automotive Chips for Reliable Timing V&V 

    Mezzetti, Enrico; Kosmidis, Leonidas; Abella, Jaume; Cazorla, Francisco J. (IEEE, 2018-01-16)
    Article
    Open Access
    As software continues to control more system-critical functions in cars, its timing is becoming an integral element in functional safety. Timing validation and verification (V&V) assesses softwares end-to-end timing ...
  • Safety-Related Challenges and Opportunities for GPUs in the Automotive Domain 

    Alcaide, Sergi; Kosmidis, Leonidas; Tabani, Hamid; Hernandez, Carles; Abella, Jaume; Cazorla, Francisco J. (IEEE, 2018-10-09)
    Article
    Open Access
    GPUs have been shown to cover the computing performance needs of autonomous driving (AD) systems. However, since the GPUs used for AD build on designs for the mainstream market, they may lack fundamental properties for ...
  • A hierarchic task-based programming model for distributed heterogeneous computing 

    Ejarque, Jorge; Dominguez, Marc; Badia, Rosa M. (SAGE Publications, 2019-05-01)
    Article
    Open Access
    Distributed computing platforms are evolving to heterogeneous ecosystems with Clusters, Grids and Clouds introducing in its computing nodes, processors with different core architectures, accelerators (i.e. GPUs, FPGAs), ...
  • Locality-aware cache random replacement policies 

    Benedicte Illescas, Pedro; Hernandez, Carles; Abella Ferrer, Jaume; Cazorla Almeida, Francisco Javier (Elsevier, 2019-02)
    Article
    Restricted access - publisher's policy
    Measurement-Based Probabilistic Timing Analysis (MBPTA) facilitates the analysis of complex software running on hardware comprising high-performance features. MBPTA also aims at preventing additional analysis costs for ...
  • Studying the impact of the Full-Network embedding on multimodal pipelines 

    Vilalta, Armand; Garcia-Gasulla, Dario; Pares, Ferran; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Moya-Sánchez, Ulises; Cortés García, Claudio Ulises (IOS Press, 2018)
    Article
    Open Access
    The current state of the art for image annotation and image retrieval tasks is obtained through deep neural network multimodal pipelines, which combine an image representation and a text representation into a shared embedding ...
  • Deep Learning is blowing in the wind. Deep models applied to wind prediction at turbine level 

    Manero, Jaume; Béjar, Javier; Cortés García, Claudio Ulises (IOP Publishing, 2019-05)
    Article
    Open Access
    Wind Energy generation depends on the existence of wind, a meteorological phenomena intermittent by nature, with the consequence of generating uncertainty on the availability of wind energy in the future. The grid stability ...
  • On the maturity of parallel applications for asymmetric multi-core processors 

    Chronaki, Kallia; Moreto Planas, Miquel; Casas, Marc; Rico, Alejandro; Badia Sala, Rosa Maria; Ayguadé Parra, Eduard; Valero Cortés, Mateo (Elsevier, 2019-05-01)
    Article
    Restricted access - publisher's policy
    Asymmetric multi-cores (AMCs) are a successful architectural solution for both mobile devices and supercomputers. By maintaining two types of cores (fast and slow) AMCs are able to provide high performance under the facility ...
  • Time-Randomized Wormhole NoCs for Critical Applications 

    Slijepcevic, Mladen; Hernandez, Carles; Abella, Jaume; Cazorla, Francisco J. (Association for Computing Machinery (ACM), 2019-02)
    Article
    Open Access
    Wormhole-based NoCs (wNoCs) are widely accepted in high-performance domains as the most appropriate solution to interconnect an increasing number of cores in the chip. However, wNoCs suitability in the context of critical ...
  • Increasing the Reliability of Software Timing Analysis for Cache-Based Processors 

    Milutinovic, Suzana; Mezzetti, Enrico; Abella, Jaume; Cazorla, Francisco J. (IEEE, 2019-06-01)
    Article
    Open Access
    Real-time systems are witnessing a significant increase in critical software's size, complexity, and performance needs, which can only be satisfied with high-performance hardware features. Cache memories, pervasively used ...

View more