Mostra el registre d'ítem simple
Towards resilient EU HPC systems: A blueprint
dc.contributor.author | Radojković, Petar |
dc.contributor.author | Marazakis, Manolis |
dc.contributor.author | Carpenter, Paul Matthew |
dc.contributor.author | Jeyapaul, Reiley |
dc.contributor.author | Gizopoulos, Dimitris |
dc.contributor.author | Schulz, Martin |
dc.contributor.author | Armejach Sanosa, Adrià |
dc.contributor.author | Ayguadé Parra, Eduard |
dc.contributor.author | Canal Corretger, Ramon |
dc.contributor.author | Moretó Planas, Miquel |
dc.contributor.author | Salami, Behzad |
dc.contributor.author | Unsal, Osman Sabri |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors |
dc.contributor.other | Universitat Politècnica de Catalunya. Doctorat en Arquitectura de Computadors |
dc.contributor.other | Barcelona Supercomputing Center |
dc.date.accessioned | 2020-10-23T08:33:21Z |
dc.date.available | 2020-10-23T08:33:21Z |
dc.date.issued | 2020-04 |
dc.identifier.citation | Radojkovic, P. [et al.]. Towards resilient EU HPC systems: A blueprint. 2020. |
dc.identifier.uri | http://hdl.handle.net/2117/330695 |
dc.description.abstract | This document aims to spearhead a Europe-wide discussion on HPC system resilience and to help the European HPC community define best practices for resilience. We analyse a wide range of state-of-the-art resilience mechanisms and recommend the most effective approaches to employ in large-scale HPC systems. Our guidelines will be useful in the allocation of available resources, as well as guiding researchers and research funding towards the enhancement of resilience approaches with the highest priority and utility. Although our work is focused on the needs of next generation HPC systems in Europe, the principles and evaluations are applicable globally. |
dc.description.sponsorship | This work has received funding from the European Union’s Horizon 2020 research and innovation programme under the projects ECOSCALE (grant agreement No 671632), EPI (grant agreement No 826647), EuroEXA (grant agreement No 754337), Eurolab4HPC (grant agreement No 800962), EVOLVE (grant agreement No 825061), EXA2PRO (grant agreement No 801015), ExaNest (grant agreement No 671553), ExaNoDe (grant agreement No 671578), EXDCI-2 (grant agreement No 800957), LEGaTO (grant agreement No 780681), MB2020 (grant agreement No 779877), RECIPE (grant agreement No 801137) and SDK4ED (grant agreement No 780572). The work was also supported by the European Commission’s Seventh Framework Programme under the projects CLERECO (grant agreement No 611404), the NCSA-Inria-ANL-BSC-JSCRiken-UTK Joint-Laboratory for Extreme Scale Computing – JLESC (https://jlesc.github.io/), OMPI-X project (No ECP-2.3.1.17) and the Spanish Government through Severo Ochoa programme (SEV-2015-0493). This work was sponsored in part by the U.S. Department of Energy's Office of Advanced Scientific Computing Research, program managers Robinson Pino and Lucy Nowell. This manuscript has been authored by UT-Battelle, LLC under Contract No DE-AC05-00OR22725 with the U.S. Department of Energy. |
dc.format.extent | 30 p. |
dc.language.iso | eng |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors |
dc.subject.lcsh | High performance computing -- Europe |
dc.title | Towards resilient EU HPC systems: A blueprint |
dc.type | External research report |
dc.subject.lemac | Càlcul intensiu (Informàtica) -- Europa |
dc.contributor.group | Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions |
dc.contributor.group | Universitat Politècnica de Catalunya. VIRTUOS - Virtualisation and Operating Systems |
dc.relation.publisherversion | https://resilienthpc.eu/ |
dc.rights.access | Open Access |
local.identifier.drac | 29519509 |
dc.description.version | Preprint |
dc.relation.projectid | info:eu-repo/grantAgreement/EC/FP7/611404/EU/Cross-Layer Early Reliability Evaluation for the Computing cOntinuum/CLERECO |
dc.relation.projectid | info:eu-repo/grantAgreement/EC/H2020/801137/EU/REliable power and time-ConstraInts-aware Predictive management of heterogeneous Exascale systems/RECIPE |
dc.relation.projectid | info:eu-repo/grantAgreement/EC/H2020/800962/EU/Consolidation of European Research Excellence in Exascale HPC Systems/EUROLAB4HPC2 |
dc.relation.projectid | info:eu-repo/grantAgreement/EC/H2020/826647/EU/SGA1 (Specific Grant Agreement 1) OF THE EUROPEAN PROCESSOR INITIATIVE (EPI)/EPI SGA1 |
dc.relation.projectid | info:eu-repo/grantAgreement/EC/H2020/671578/EU/European Exascale Processor Memory Node Design/ExaNoDe |
dc.relation.projectid | info:eu-repo/grantAgreement/EC/H2020/780681/EU/Low Energy Toolset for Heterogeneous Computing/LEGaTO |
dc.relation.projectid | info:eu-repo/grantAgreement/EC/H2020/779877/EU/Mont-Blanc 2020, European scalable, modular and power efficient HPC processor/Mont-Blanc 2020 |
local.citation.author | Radojkovic, P.; Marazakis, M.; Carpenter, P.; Jeyapaul, R.; Gizopoulos, D.; Schulz, M.; Armejach, A.; Ayguadé, E.; Canal, R.; Moreto, M.; Salami, B.; Unsal, O. |
Fitxers d'aquest items
Aquest ítem apareix a les col·leccions següents
-
Reports de recerca [58]
-
Reports de recerca [8]
-
Reports de recerca [2]
-
Reports de recerca [181]
-
Reports de recerca [15]