Ara es mostren els items 41-60 de 274

    • Automatic multilevel parallelization using OpenMP 

      Jin, H; Jost, G; Yan, J; Ayguadé Parra, Eduard; González Tallada, Marc; Martorell Bofill, Xavier (2004-06)
      Article
      Accés restringit per política de l'editorial
      In this paper we describe the extension of the CAPO parallelization support tool to support multilevel parallelism based on OpenMP directives. CAPO generates OpenMP directives with extensions supported by the NanosCompiler ...
    • Automatized fire room 

      Higa, Juan Diego (Universitat Politècnica de Catalunya, 2018-06)
      Treball Final de Grau
      Accés restringit per acord de confidencialitat
    • AutoParallel: Automatic parallelisation and distributed execution of affine loop nests in Python 

      Ramón Cortés, Cristian; Amela Milian, Ramon; Ejarque Artigas, Jorge; Clauss, Philippe; Badia Sala, Rosa Maria (Sage, 2020)
      Article
      Accés obert
      The last improvements in programming languages and models have focused on simplicity and abstraction; leading Python to the top of the list of the programming languages. However, there is still room for improvement when ...
    • Big Data technologies for High Performance Computing 

      Martínez Blanco, Miquel (Universitat Politècnica de Catalunya, 2020-06)
      Treball Final de Grau
      Accés obert
      Hecuba is a tool written in Python and C++ developed in the Barcelona Supercomputing Center (BSC), it allows to simplify the process of reading and writing in Cassandra databases. The objective is to integrate Hecuba into ...
    • Breaking master-slave model between host and FPGAs 

      Bosch Pons, Jaume; Vidal, Miquel; Filgueras Izquierdo, Antonio; Álvarez Martínez, Carlos; Jiménez González, Daniel; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2020)
      Comunicació de congrés
      Accés obert
      This paper proposes to enhance current task-based programming models by breaking their current master-slave approach between the main processor and its hardware accelerators. As a proof-of-concept, it presents an extension ...
    • Clock gate on abort: Towards energy-efficient hardware transactional memory 

      Sanyal, Sutirtha; Roy, Sourav; Cristal Kestelman, Adrián; Unsal, Osman Sabri; Valero Cortés, Mateo (Institute of Electrical and Electronics Engineers (IEEE), 2009)
      Text en actes de congrés
      Accés obert
      Transactional Memory (TM) is an emerging technology which promises to make parallel programming easier compared to earlier lock based approaches. However, as with any form of speculation, Transactional Memory too wastes a ...
    • Coarse grain parallelization of deep neural networks 

      González Tallada, Marc (Institute of Electrical and Electronics Engineers (IEEE), 2016)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      Deep neural networks (DNN) have recently achieved extraordinary results in domains like computer vision and speech recognition. An essential element for this success has been the introduction of high performance computing ...
    • Code generation for the dataflow-based XSMLL runtime 

      Peyrolón Lago, Daniel (Universitat Politécnica de Catalunya, 2017-10)
      Projecte Final de Màster Oficial
      Accés obert
      Realitzat a/amb:   Barcelona Supercomputing Center
      This document presents the report of the Master’s Thesis Code generation for the dataflow-based xsmll runtime developed in the context of the Master in Innovation and Research in Informatics. In this report we analyze ...
    • Com programar en X10 

      Dominguez Peñacoba, Jorge (Universitat Politècnica de Catalunya, 2008-01-15)
      Projecte/Treball Final de Carrera
      Accés obert
      Estudi del llenguatge X10, manual de consulta del propi llenguatge tot fent primer una breu descripció del llenguatge i les seves característiques, i per una altra part, realitzar una recerca en el camp de la tecnologia ...
    • Combining one-sided communications with task-based programming models 

      Sala Penadés, Kevin; Macià Sorrosal, Sandra; Beltran Querol, Vicenç (Institute of Electrical and Electronics Engineers (IEEE), 2021)
      Text en actes de congrés
      Accés obert
      Hybrid programming combining task-based and message-passing models is an increasingly popular technique to exploit multi-core clusters. The Task-Aware MPI (TAMPI) library integrates both models enabling the safe overlap ...
    • Combining two formal methods of the static analyses 

      Honorat Poblette, Jorge Luis (Universitat Politècnica de Catalunya, 2012-06-19)
      Projecte Final de Màster Oficial
      Accés obert
      [ANGLÈS] Given the background in software and hardware evolution over the years as well as the demand of accurate information in different industrial areas as aeronautics, nuclear and medical is how this Master thesis born. ...
    • COMP Superscalar, an interoperable programming framework 

      Badia Sala, Rosa Maria; Conejero, Javier; Díaz, Carlos; Ejarque, Jorge; Lezzi, Daniele; Lordan Gomis, Francesc-Josep; Ramón Cortés, Cristian; Sirvent Pardell, Raül (2015-12-01)
      Article
      Accés obert
      COMPSs is a programming framework that aims to facilitate the parallelization of existing applications written in Java, C/C++ and Python scripts. For that purpose, it offers a simple programming model based on sequential ...
    • Comparing MapReduce and pipeline implementations for counting triangles 

      Pasarella Sánchez, Ana Edelmira; Vidal Serodio, Maria Esther; Zoltan, Cristina (2016)
      Text en actes de congrés
      Accés obert
      A generalized method to define the Divide & Conquer paradigm in order to have processors acting on its own data and scheduled in a parallel fashion. MapReduce is a programming model that follows this paradigm, and allows ...
    • Comparing MapReduce and pipeline implementations for counting triangles 

      Pasarella Sánchez, Ana Edelmira; Vidal, Maria-Esther; Zoltan Torres, Ana Cristina (2017-01-11)
      Article
      Accés obert
      A common method to define a parallel solution for a computational problem consists in finding a way to use the Divide and Conquer paradigm in order to have processors acting on its own data and scheduled in a parallel ...
    • Compiler automatic discovery of OmpSs task dependencies 

      Royuela, Sara; Duran González, Alejandro; Martorell Bofill, Xavier (Springer, 2013)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Dependence analysis is an essential step for many compiler optimizations, from simple loop transformations to automatic parallelization. Parallel programming models require specific dependence analyses that take into account ...
    • Complex pipelined executions in OpenMP parallel applications 

      González Tallada, Marc; Ayguadé Parra, Eduard; Martorell Bofill, Xavier; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2001)
      Text en actes de congrés
      Accés obert
      This paper proposes a set of extensions to the OpenMP programming model to express complex pipelined computations. This is accomplished by defining, in the form of directives, precedence relations among the tasks originated ...
    • Computació paral.lela en iOS per experts en MPI 

      Català i Montaner, Marc (Universitat Politècnica de Catalunya, 2019-10)
      Treball Final de Grau
      Accés obert
      Cada dia apareixen noves aplicacions relacionades amb el món de la tecnologia. Com és lògic, hi ha molts camps que han estat objectius d'una gran quantitat d'estudis, mentre que hi ha altres que passen més desapercebuts. ...
    • Computational Fluid and Particle Dynamics Simulations for Respiratory System: Runtime Optimization on an Arm Cluster 

      Garcia-Gasulla, Marta; Josep-Fabrego, Marc; Eguzkitza, Beatriz; Mantovani, Filippo (Association for Computing Machinery (ACM), 2018-08-13)
      Comunicació de congrés
      Accés obert
      Computational fluid and particle dynamics simulations (CFPD) are of paramount importance for studying and improving drug effectiveness. Computational requirements of CFPD codes involves high-performance computing (HPC) ...
    • Computational improvements of microaggregation algorithms for the anonymization of large-scale datasets 

      García Álvarez, Alejandro (Universitat Politècnica de Catalunya, 2017-01)
      Treball Final de Grau
      Accés obert
      The technical contents of this work fall within the field of statistical disclosure control (SDC), which concerns the postprocessing of the demographic portion of the statistical results of surveys containing sensitive ...
    • Compute units in OpenMP: extensions for heterogeneous parallel programming 

      González Tallada, Marc; Morancho Llena, Enrique (John Wiley & sons, 2024-01-10)
      Article
      Accés obert
      This article evaluates the current support for heterogeneous OpenMP 5.2 applications regarding the simultaneous activation of host and device computing units (e.g., CPUs, GPUs, or FPGAs). The article identifies limitations ...