Enviaments recents

  • Dynamic Adaptable Asynchronous Progress Model for MPI RMA Multiphase Applications 

    Si, Min; Peña, Antonio J.; Hammond, Jeff; Balaji, Pavan; Takagi, Masamichi; Ishikawa, Yutaka (IEEE, 2018-09-01)
    Article
    Accés obert
    Casper is a process-based asynchronous progress model for MPI one-sided communication on multi- and many-core architectures. The one-sided communication is not truly one-sided in most MPI implementations: the target process ...
  • An approach to task-based parallel programming for undergraduate students 

    Ayguadé Parra, Eduard; Jiménez González, Daniel (2018-03-07)
    Article
    Accés obert
    This paper presents the description of a compulsory parallel programming course in the bachelor degree in Informatics Engineering at the Barcelona School of Informatics, Universitat Politècnica de Catalunya UPC-BarcelonaTech. ...
  • Impact on Network Performance of Probe Vehicle Data Usage: An Experimental Design for Simulation Assessment 

    Montero, Lídia; Linares, Maria Paz; Casanovas, Josep; Codina, Esteve; Recio, Gonzalo; Lorente, Ester; Salmerón, Juan (Hindawi Publishing Corporation, 2018-06-25)
    Article
    Accés obert
    Probe-based technologies are proliferating as a means of inferring traffic states. Technological companies are interested in traffic data for computing the best routes in a traffic-aware manner and they also provide real-time ...
  • A resilient and distributed near real-time traffic forecasting application for Fog computing environments 

    Pérez, Juan L.; Gutierrez-Torre, Alberto; Berral, Josep Ll.; Carrera, David (Elsevier, 2018-10)
    Article
    Accés obert
    In this paper we propose an architecture for a city-wide traffic modeling and prediction service based on the Fog Computing paradigm. The work assumes an scenario in which a number of distributed antennas receive data ...
  • Understanding memory access patterns using the BSC performance tools 

    Servat, Harald; Labarta, Jesús; Hoppe, Hans-Christian; Giménez, Judit; Peña, Antonio J. (Elsevier, 2018-10)
    Article
    Accés restringit per política de l'editorial
    The growing gap between processor and memory speeds has lead to complex memory hierarchies as processors evolve to mitigate such divergence by exploiting the locality of reference. In this direction, the BSC performance ...
  • Resilient gossip-inspired all-reduce algorithms for high-performance computing - Potential, limitations, and open questions 

    Casas, Marc; Gansterer, Wilfried N.; Wimmer, Elias (SAGE Publications, 2018-04-09)
    Article
    Accés obert
    We investigate the usefulness of gossip-based reduction algorithms in a high-performance computing (HPC) context. We compare them to state-of-the-art deterministic parallel reduction algorithms in terms of fault tolerance ...
  • Performance and Power Analysis of HPC Workloads on Heterogenous Multi-Node Clusters 

    Mantovani, Filippo; Calore, Enrico (MDPI, 2018-05-04)
    Article
    Accés obert
    Performance analysis tools allow application developers to identify and characterize the inefficiencies that cause performance degradation in their codes, allowing for application optimizations. Due to the increasing ...
  • Reducing cache coherence traffic with a NUMA-aware runtime approach 

    Caheny, Paul; Alvarez, Lluc; Derradji, Said; Valero Cortés, Mateo; Moreto Planas, Miquel; Casas Guix, Marc (2018-05)
    Article
    Accés obert
    Cache Coherent NUMA (ccNUMA) architectures are a widespread paradigm due to the benefits they provide for scaling core count and memory capacity. Also, the flat memory address space they offer considerably improves ...
  • On the tailoring of CAST-32A certification guidance to real COTS multicore architectures 

    Agirre, Irune; Abella, Jaume; Azkarate-Askasua, Mikel; Cazorla, Francisco J. (IEEE, 2018-03-12)
    Comunicació de congrés
    Accés obert
    The use of Commercial Off-The-Shelf (COTS) multicores in real-time industry is on the rise due to multicores' potential performance increase and energy reduction. Yet, the unpredictable impact on timing of contention in ...
  • Reconciling Time Predictability and Performance in Future Computing Systems 

    Cazorla, Francisco J.; Abella, Jaume; Mezzetti, Enrico; Hernandez, Carles; Vardanega, Tullio; Bernat, Guillem (IEEE, 2018-04)
    Article
    Accés obert
    MBTA studies the system’s timing in analysis scenarios, to determine upper bounds to the worst-case execution-time behavior that may occur at operation. MBTA’s challenge is to construct analysis-time scenarios that help ...
  • Automatic Generation of Workload Profiles Using Unsupervised Learning Pipelines 

    Buchaca Prats, David; Berral, Josep Ll.; Carrera, David (IEEE, 2018-03)
    Article
    Accés obert
    The complexity of resource usage and power consumption on cloud-based applications makes the understanding of application behavior through expert examination difficult. The difficulty increases when applications are seen ...
  • Execution time distributions in embedded safety-critical systems using extreme value theory 

    del Castillo, Joan; Padilla, Maria; Abella, Jaume; Cazorla, Francisco J. (Inderscience, 2017)
    Article
    Accés restringit per política de l'editorial
    Several techniques have been proposed to upper-bound the worst-case execution time behaviour of programs in the domain of critical real-time embedded systems. These computing systems have strong requirements regarding the ...

Mostra'n més