Enviaments recents

  • PerfBound: Conserving Energy with Bounded Overheads in On/Off-Based HPC Interconnects 

    Saravanan, Karthikeyan P.; Carpenter, Paul M. (IEEE, 2018-07-01)
    Article
    Accés obert
    Energy and power are key challenges in high-performance computing. System energy efficiency must be significantly improved, and this requires greater efficiency in all subcomponents. An important target of optimization is ...
  • Executing linear algebra kernels in heterogeneous distributed infrastructures with PyCOMPSs 

    Amela, Ramon; Ramon-Cortes, Cristian; Ejarque, Jorge; Conejero, Javier; Badia, Rosa M. (EDP Open, 2018-10-24)
    Article
    Accés obert
    Python is a popular programming language due to the simplicity of its syntax, while still achieving a good performance even being an interpreted language. The adoption from multiple scientific communities has evolved in ...
  • Fitting Software Execution-Time Exceedance into a Residual Random Fault in ISO-26262 

    Agirre, Irune; Cazorla, Francisco J.; Abella, Jaume; Hernandez, Carles; Mezzetti, Enrico; Azkarate-askasua, Mikel; Vardanega, Tullio (IEEE, 2018-09-01)
    Accés obert
    Car manufacturers relentlessly replace or augment the functionality of mechanical subsystems with electronic components. Most such subsystems (e.g., steer-by-wire) are safety related, hence, subject to regulation. ISO-26262, ...
  • Dynamic Adaptable Asynchronous Progress Model for MPI RMA Multiphase Applications 

    Si, Min; Peña, Antonio J.; Hammond, Jeff; Balaji, Pavan; Takagi, Masamichi; Ishikawa, Yutaka (IEEE, 2018-09-01)
    Article
    Accés obert
    Casper is a process-based asynchronous progress model for MPI one-sided communication on multi- and many-core architectures. The one-sided communication is not truly one-sided in most MPI implementations: the target process ...
  • An approach to task-based parallel programming for undergraduate students 

    Ayguadé Parra, Eduard; Jiménez González, Daniel (2018-03-07)
    Article
    Accés obert
    This paper presents the description of a compulsory parallel programming course in the bachelor degree in Informatics Engineering at the Barcelona School of Informatics, Universitat Politècnica de Catalunya UPC-BarcelonaTech. ...
  • Impact on Network Performance of Probe Vehicle Data Usage: An Experimental Design for Simulation Assessment 

    Montero, Lídia; Linares, Maria Paz; Casanovas, Josep; Codina, Esteve; Recio, Gonzalo; Lorente, Ester; Salmerón, Juan (Hindawi Publishing Corporation, 2018-06-25)
    Article
    Accés obert
    Probe-based technologies are proliferating as a means of inferring traffic states. Technological companies are interested in traffic data for computing the best routes in a traffic-aware manner and they also provide real-time ...
  • A resilient and distributed near real-time traffic forecasting application for Fog computing environments 

    Pérez, Juan L.; Gutierrez-Torre, Alberto; Berral, Josep Ll.; Carrera, David (Elsevier, 2018-10)
    Article
    Accés obert
    In this paper we propose an architecture for a city-wide traffic modeling and prediction service based on the Fog Computing paradigm. The work assumes an scenario in which a number of distributed antennas receive data ...
  • Understanding memory access patterns using the BSC performance tools 

    Servat, Harald; Labarta, Jesús; Hoppe, Hans-Christian; Giménez, Judit; Peña, Antonio J. (Elsevier, 2018-10)
    Article
    Accés restringit per política de l'editorial
    The growing gap between processor and memory speeds has lead to complex memory hierarchies as processors evolve to mitigate such divergence by exploiting the locality of reference. In this direction, the BSC performance ...
  • Resilient gossip-inspired all-reduce algorithms for high-performance computing - Potential, limitations, and open questions 

    Casas, Marc; Gansterer, Wilfried N.; Wimmer, Elias (SAGE Publications, 2018-04-09)
    Article
    Accés obert
    We investigate the usefulness of gossip-based reduction algorithms in a high-performance computing (HPC) context. We compare them to state-of-the-art deterministic parallel reduction algorithms in terms of fault tolerance ...
  • Asynchronous and exact forward recovery for detected errors in iterative solvers 

    Jaulmes, Luc Etienne; Casas, Marc; Moreto Planas, Miquel; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Valero Cortés, Mateo (2018-03-19)
    Article
    Accés obert
    Current trends and projections show that faults in computer systems become increasingly common. Such errors may be detected, and possibly corrected transparently, e.g. by Error Correcting Codes (ECC). For a program to be ...
  • A general guide to applying machine learning to computer architecture 

    Nemirovsky, Daniel; Arkose, Tugberk; Markovic, Nikola; Nemirovsky, Mario; Unsal, Osman Sabri; Cristal Kestelman, Adrián; Valero Cortés, Mateo (2018)
    Article
    Accés obert
    The resurgence of machine learning since the late 1990s has been enabled by significant advances in computing performance and the growth of big data. The ability of these algorithms to detect complex patterns in data which ...
  • Performance and Power Analysis of HPC Workloads on Heterogenous Multi-Node Clusters 

    Mantovani, Filippo; Calore, Enrico (MDPI, 2018-05-04)
    Article
    Accés obert
    Performance analysis tools allow application developers to identify and characterize the inefficiencies that cause performance degradation in their codes, allowing for application optimizations. Due to the increasing ...

Mostra'n més