Enviaments recents

  • Efficient CFD code implementation for the ARM-based Mont-Blanc architecture 

    Oyarzun, G.; Borrell, Ricard; Gorobets, A.; Mantovani, Filippo; Oliva, A. (Elsevier, 2018-02)
    Article
    Accés obert
    Since 2011, the European project Mont-Blanc has been focused on enabling ARM-based technology for HPC, developing both hardware platforms and system software. The latest Mont-Blanc prototypes use system-on-chip (SoC) devices ...
  • Improving the integration of task nesting and dependencies in OpenMP 

    Pérez, Josep M.; Beltran, Vicenç; Labarta, Jesús; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    Text en actes de congrés
    Accés obert
    The tasking model of OpenMP 4.0 supports both nesting and the definition of dependences between sibling tasks. A natural way to parallelize many codes with tasks is to first taskify the high-level functions and then to ...
  • Picos, a hardware task-dependence manager for task-based dataflow programming models 

    Tan, Xubin; Bosch, Jaume; Vidal, Miquel; Álvarez, Carlos; Jiménez-González, Daniel; Ayguadé Parra, Eduard; Valero Cortés, Mateo (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    Text en actes de congrés
    Accés obert
    Task-based programming Task-based programming models such as OpenMP, Intel TBB and OmpSs are widely used to extract high level of parallelism of applications executed on multi-core and manycore platforms. These programming ...
  • HetFS: A heterogeneous file system for everyone 

    Koloventzos, Georgios; Nou, Ramon; Miranda, Alberto; Cortés, Toni (Springer, 2017)
    Text en actes de congrés
    Accés obert
    Storage devices have been getting more and more diverse during the last decade. The advent of SSDs made it painfully clear that rotating devices, such as HDDs or magnetic tapes, were lacking in regards to response time. ...
  • Energy Efficient Ethernet on MapReduce Clusters: Packet Coalescing To Improve 10GbE Links 

    Fischer e Silva, Renan; Carpenter, Paul M. (IEEE, 2017-10)
    Article
    Accés obert
    An important challenge of modern data centers is to reduce energy consumption, of which a substantial proportion is due to the network. Switches and NICs supporting the recent energy efficient Ethernet (EEE) standard are ...
  • Integrating memory perspective into the BSC performance tools 

    Servat, Harald; Labarta Mancho, Jesús José; Hoppe, Hans-Christian; Gimenez, Judit; Peña, Antonio J. (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    Text en actes de congrés
    Accés obert
    The growing gap between processor and memory speeds results in complex memory hierarchies as processors evolve to mitigate such differences by taking advantage of locality of reference. In this direction, the BSC performance ...
  • Exploiting key-value data stores scalability for HPC 

    Cugnasco, Cesare; Becerra Fontal, Yolanda; Torres Viñals, Jordi; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    Text en actes de congrés
    Accés obert
    BigData revolutionised the IT industry. It first interested the OLTP systems. Distributed Hash Tables replaced Traditional SQL databases as they guaranteed low response time on simple read/write requests. The second wave ...
  • Code modernization strategies to 3-D Stencil-based applications on Intel Xeon Phi: KNC and KNL 

    Cebrián, Juan M.; Cecilia, José M.; Hernández, Mario; García, José M. (Elsevier, 2017-11-15)
    Article
    Accés restringit per política de l'editorial
    Partial Differential Equations (PDEs) are widely used to simulate many scenarios in science and engineering, usually solved through iterative techniques (e.g., Jacobi, Gauss–Seidel). These methods produce an approximate ...
  • Beyond the socket: NUMA-aware GPUs 

    Ugljesa, Milic; Villa, Oreste; Bolotin, Evgeny; Arunkumar, Akhil; Ebrahimi, Eiman; Jaleel, Aamer; Ramirez, Alex; Nellans, David (Association for Computing Machinery, 2017-10)
    Comunicació de congrés
    Accés obert
    GPUs achieve high throughput and power efficiency by employing many small single instruction multiple thread (SIMT) cores. To minimize scheduling logic and performance variance they utilize a uniform memory system and ...
  • Is Arm software ecosystem ready for HPC? 

    Banchelli Gracia, Fabio F.; Ruiz, Daniel; Hao Xu Lin, Ying; Mantovani, Filippo (2017-11-14)
    Comunicació de congrés
    Accés obert
    In recent years, the HPC community has increasingly grown its interest towards the Arm architecture with research projects targeting primarily the installation of Arm-based clusters. State of the art research project ...
  • Enabling a reliable STT-MRAM main memory simulation 

    Asifuzzaman, Kazi; Sánchez-Verdejo, Rommel; Radojković, Petar (Association for Computing Machinery, 2017-10)
    Comunicació de congrés
    Accés obert
    STT-MRAM is a promising new memory technology with very desirable set of properties such as non-volatility, byte-addressability and high endurance. It has the potential to become the universal memory that could be incorporated ...
  • PaaS-IaaS inter-layer adaptation in an energy-aware cloud environment 

    Djemame, Karim; Bosch, Raimon; Kavanagh, Richard; Alvarez, Pol; Ejarque, Jorge; Guitart Fernández, Jordi; Blasi, Lorenzo (Institute of Electrical and Electronics Engineers (IEEE), 2017-06)
    Article
    Accés obert
    Cloud computing providers resort to a variety of techniques to improve energy consumption at each level of the cloud computing stack. Most of these techniques consider resource-level energy optimization at IaaS layer. This ...

Mostra'n més