Ara es mostren els items 21-40 de 274

    • Advanced synchronization techniques for task-based runtime systems 

      Álvarez Robert, David; Sala Penadés, Kevin; Maroñas Bravo, Marcos; Roca Nonell, Aleix; Beltran Querol, Vicenç (Association for Computing Machinery (ACM), 2021)
      Text en actes de congrés
      Accés obert
      Task-based programming models like OmpSs-2 and OpenMP provide a flexible data-flow execution model to exploit dynamic, irregular and nested parallelism. Providing an efficient implementation that scales well with small ...
    • Algoritmo genético para la programación multicriterio de piezas en máquinas paralelas 

      Rodriguez Urbina, Daniel (Universitat Politècnica de Catalunya, 2009-06)
      Projecte/Treball Final de Carrera
      Accés obert
      Este proyecto tiene como objetivo desarrollar un procedimiento de programación de piezas en un entorno productivo formado por máquinas idénticas en paralelo, considerando una función objetivo multicriterio. En el problema ...
    • Align and distribute-based linear loop transformations 

      Torres Viñals, Jordi; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Valero Cortés, Mateo (Springer, 1993)
      Text en actes de congrés
      Accés obert
      In this paper we generalize the framework of linear loop transformations in the sense that loop alignment is considered as a new component in the transformation process. The aim is to match the structure of loop nests with ...
    • Alya: Multiphysics engineering simulation toward exascale 

      Vázquez, Mariano; Houzeaux, Guillaume; Koric, Seid; Artigues, Antoni; Aguado Sierra, Jazmin; Arís Sánchez, Ruth; Mira Martínez, Daniel; Calmet, Hadrien; Cucchietti, Fernando; Owen, Herbert; Taha, Ahmed; Dering Burness, Evan; Cela Espín, José M.; Valero Cortés, Mateo (Elsevier, 2016-05)
      Article
      Accés obert
      Alya is a multi-physics simulation code developed at Barcelona Supercomputing Center (BSC). From its inception Alya code is designed using advanced High Performance Computing programming techniques to solve coupled problems ...
    • An Analysis of Lazy and Eager Limited Preemption Approaches under DAG-Based Global Fixed Priority Scheduling 

      Serrano, Maria A.; Melani, Alessandra; Kehr, Sebastian; Bertogna, Marko; Quiñones, Eduardo (Institute of Electrical and Electronics Engineers (IEEE), 2017-07-03)
      Comunicació de congrés
      Accés obert
      DAG-based scheduling models have been shown to effectively express the parallel execution of current many-core heterogeneous architectures. However, their applicability to real-time settings is limited by the difficulties ...
    • An approach to task-based parallel programming for undergraduate students 

      Ayguadé Parra, Eduard; Jiménez González, Daniel (2018-03-07)
      Article
      Accés obert
      This paper presents the description of a compulsory parallel programming course in the bachelor degree in Informatics Engineering at the Barcelona School of Informatics, Universitat Politècnica de Catalunya UPC-BarcelonaTech. ...
    • An architecture for resource management in a fog-to-cloud framework 

      Sengupta, Souvik; García Almiñana, Jordi; Masip Bruin, Xavier (Springer, 2018)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Fog-to-cloud (F2C) platforms provide an excellent framework for the efficient resource management in the context of smart cities. In such a scenario, a vast number of heterogeneous resources, including computing devices ...
    • An integrated vector-scalar design on an in-order ARM core 

      Stanic, Milan; Palomar Pérez, Óscar; Hayes, Timothy; Ratkovic, Ivan; Cristal Kestelman, Adrián; Unsal, Osman Sabri; Valero Cortés, Mateo (2017-07)
      Article
      Accés obert
      In the low-end mobile processor market, power, energy, and area budgets are significantly lower than in the server/desktop/laptop/high-end mobile markets. It has been shown that vector processors are a highly energy-efficient ...
    • An unified parallel C compiler that implements automatic communication aggregation 

      Barton, Christopher; Almási, George; Farreras Esclusa, Montserrat; Amaral, José Nelson (2009)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Partitioned Global Address Space (PGAS) programming languages, such as Unified Parallel C (UPC), offer an attractive high-productivity programming model for programming large-scale parallel machines. PGAS languages partition ...
    • Analyzing the efficiency of hybrid codes 

      Giménez Lucas, Judit; Mercadal, Estanislao; Llort, German; Méndez, Sandra (Institute of Electrical and Electronics Engineers (IEEE), 2020)
      Text en actes de congrés
      Accés obert
      Hybrid parallelization may be the only path for most codes to use HPC systems on a very large scale. Even within a small scale, with an increasing number of cores per node, combining MPI with some shared memory thread-based ...
    • Anàlisi i integració de models de programació paral·lels en SoC Tegra 2 

      Prat Robles, David (Universitat Politècnica de Catalunya, 2011-01-19)
      Projecte/Treball Final de Carrera
      Accés obert
      Anàlisi i integració de models de programació paral.lels en SoC Tegra 2 Instal·lació Linux i verificació del funcionament Benchmarks amb altres processadors Escalabilitat de Pthreads VS OpenMP Muntatge del clúster amb MPI ...
    • Applying interposition techniques for performance analysis of OPENMP parallel applications 

      González Tallada, Marc; Serra, Albert; Martorell Bofill, Xavier; Oliver Segura, José; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Navarro, Nacho (Institute of Electrical and Electronics Engineers (IEEE), 2000)
      Text en actes de congrés
      Accés obert
      Tuning parallel applications requires the use of effective tools for detecting performance bottlenecks. Along a parallel program execution, many individual situations of performance degradation may arise. We believe that ...
    • Assembling a high-productivity DSL for computational fluid dynamics 

      Macià, Sandra; Martínez-Ferrer, Pedro J.; Mateo, Sergi; Beltran Querol, Vicenç; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2019)
      Text en actes de congrés
      Accés obert
      As we move towards exascale computing, an abstraction for effective parallel computation is increasingly needed to overcome the maintainability and portability of scientific applications while ensuring the efficient and ...
    • Asynchronous runtime for task-based dataflow programming models 

      Bosch Pons, Jaume (Universitat Politècnica de Catalunya, 2017-07)
      Projecte Final de Màster Oficial
      Accés obert
      Realitzat a/amb:   Barcelona Supercomputing Center
      The importance of parallel programming is increasing year after year since the power wall popularized multi-core processors, and with them, shared memory parallel programming models. In particular, task-based programming ...
    • Asynchronous runtime with distributed manager for task-based programming models 

      Bosch Pons, Jaume; Álvarez Martínez, Carlos; Jiménez González, Daniel; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (2020-09)
      Article
      Accés obert
      Parallel task-based programming models, like OpenMP, allow application developers to easily create a parallel version of their sequential codes. The standard OpenMP 4.0 introduced the possibility of describing a set of ...
    • Atomic quake: using transactional memory in an interactive mulitplayer game Server 

      Zyulkyarov, Ferad; Gajinov, Vladimir; Unsal, Osman Sabri; Cristal Kestelman, Adrián; Ayguadé Parra, Eduard; Harris, Tim; Valero Cortés, Mateo (2009)
      Text en actes de congrés
      Accés obert
      Transactional Memory (TM) is being studied widely as a new technique for synchronizing concurrent accesses to shared memory data structures for use in multi-core systems. Much of the initial work on TM has been evaluated ...
    • Attitude Determination System for a Phased Array Beamformer 

      Garrucho Moras, Lidia (Universitat Politècnica de Catalunya, 2015-02)
      Treball Final de Grau
      Accés restringit per acord de confidencialitat
    • Auto-tuned OpenCL kernel co-execution in OmpSs for heterogeneous systems 

      Pérez, Borja; Stafford, Esteban; Bosque Orero, José Luis; Beivide Palacio, Ramon; Mateo Bellido, Sergi; Teruel García, Xavier; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (2019-03-01)
      Article
      Accés obert
      The emergence of heterogeneous systems has been very notable recently. The nodes of the most powerful computers integrate several compute accelerators, like GPUs. Profiting from such node configurations is not a trivial ...
    • Automatic communication coalescing for irregular computations in UPC language 

      Alvanos, Michail; Tiotto, Ettore; Farreras Esclusa, Montserrat; Martorell Bofill, Xavier (IBM, 2012)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Partitioned Global Address Space (PGAS) languages appeared to address programmer productivity in large scale parallel machines. However, fine grain accesses on shared structures have been identified as one of the main ...
    • Automatic detection of parallel applications computation phases 

      González García, Juan; Giménez Lucas, Judit; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2014)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Analyzing parallel programs has become increasingly difficult due to the immense amount of information collected on large systems. The use of clustering techniques has been proposed to analyze applications. However, while ...