Exploració per tema "Parallel programming (Computer science)"
Ara es mostren els items 21-40 de 274
-
Advanced synchronization techniques for task-based runtime systems
(Association for Computing Machinery (ACM), 2021)
Text en actes de congrés
Accés obertTask-based programming models like OmpSs-2 and OpenMP provide a flexible data-flow execution model to exploit dynamic, irregular and nested parallelism. Providing an efficient implementation that scales well with small ... -
Algoritmo genético para la programación multicriterio de piezas en máquinas paralelas
(Universitat Politècnica de Catalunya, 2009-06)
Projecte/Treball Final de Carrera
Accés obertEste proyecto tiene como objetivo desarrollar un procedimiento de programación de piezas en un entorno productivo formado por máquinas idénticas en paralelo, considerando una función objetivo multicriterio. En el problema ... -
Align and distribute-based linear loop transformations
(Springer, 1993)
Text en actes de congrés
Accés obertIn this paper we generalize the framework of linear loop transformations in the sense that loop alignment is considered as a new component in the transformation process. The aim is to match the structure of loop nests with ... -
Alya: Multiphysics engineering simulation toward exascale
(Elsevier, 2016-05)
Article
Accés obertAlya is a multi-physics simulation code developed at Barcelona Supercomputing Center (BSC). From its inception Alya code is designed using advanced High Performance Computing programming techniques to solve coupled problems ... -
An Analysis of Lazy and Eager Limited Preemption Approaches under DAG-Based Global Fixed Priority Scheduling
(Institute of Electrical and Electronics Engineers (IEEE), 2017-07-03)
Comunicació de congrés
Accés obertDAG-based scheduling models have been shown to effectively express the parallel execution of current many-core heterogeneous architectures. However, their applicability to real-time settings is limited by the difficulties ... -
An approach to task-based parallel programming for undergraduate students
(2018-03-07)
Article
Accés obertThis paper presents the description of a compulsory parallel programming course in the bachelor degree in Informatics Engineering at the Barcelona School of Informatics, Universitat Politècnica de Catalunya UPC-BarcelonaTech. ... -
An architecture for resource management in a fog-to-cloud framework
(Springer, 2018)
Text en actes de congrés
Accés restringit per política de l'editorialFog-to-cloud (F2C) platforms provide an excellent framework for the efficient resource management in the context of smart cities. In such a scenario, a vast number of heterogeneous resources, including computing devices ... -
An integrated vector-scalar design on an in-order ARM core
(2017-07)
Article
Accés obertIn the low-end mobile processor market, power, energy, and area budgets are significantly lower than in the server/desktop/laptop/high-end mobile markets. It has been shown that vector processors are a highly energy-efficient ... -
An unified parallel C compiler that implements automatic communication aggregation
(2009)
Text en actes de congrés
Accés restringit per política de l'editorialPartitioned Global Address Space (PGAS) programming languages, such as Unified Parallel C (UPC), offer an attractive high-productivity programming model for programming large-scale parallel machines. PGAS languages partition ... -
Analyzing the efficiency of hybrid codes
(Institute of Electrical and Electronics Engineers (IEEE), 2020)
Text en actes de congrés
Accés obertHybrid parallelization may be the only path for most codes to use HPC systems on a very large scale. Even within a small scale, with an increasing number of cores per node, combining MPI with some shared memory thread-based ... -
Anàlisi i integració de models de programació paral·lels en SoC Tegra 2
(Universitat Politècnica de Catalunya, 2011-01-19)
Projecte/Treball Final de Carrera
Accés obertAnàlisi i integració de models de programació paral.lels en SoC Tegra 2 Instal·lació Linux i verificació del funcionament Benchmarks amb altres processadors Escalabilitat de Pthreads VS OpenMP Muntatge del clúster amb MPI ... -
Applying interposition techniques for performance analysis of OPENMP parallel applications
(Institute of Electrical and Electronics Engineers (IEEE), 2000)
Text en actes de congrés
Accés obertTuning parallel applications requires the use of effective tools for detecting performance bottlenecks. Along a parallel program execution, many individual situations of performance degradation may arise. We believe that ... -
Assembling a high-productivity DSL for computational fluid dynamics
(Association for Computing Machinery (ACM), 2019)
Text en actes de congrés
Accés obertAs we move towards exascale computing, an abstraction for effective parallel computation is increasingly needed to overcome the maintainability and portability of scientific applications while ensuring the efficient and ... -
Asynchronous runtime for task-based dataflow programming models
(Universitat Politècnica de Catalunya, 2017-07)
Projecte Final de Màster Oficial
Accés obert
Realitzat a/amb: Barcelona Supercomputing CenterThe importance of parallel programming is increasing year after year since the power wall popularized multi-core processors, and with them, shared memory parallel programming models. In particular, task-based programming ... -
Asynchronous runtime with distributed manager for task-based programming models
(2020-09)
Article
Accés obertParallel task-based programming models, like OpenMP, allow application developers to easily create a parallel version of their sequential codes. The standard OpenMP 4.0 introduced the possibility of describing a set of ... -
Atomic quake: using transactional memory in an interactive mulitplayer game Server
(2009)
Text en actes de congrés
Accés obertTransactional Memory (TM) is being studied widely as a new technique for synchronizing concurrent accesses to shared memory data structures for use in multi-core systems. Much of the initial work on TM has been evaluated ... -
Attitude Determination System for a Phased Array Beamformer
(Universitat Politècnica de Catalunya, 2015-02)
Treball Final de Grau
Accés restringit per acord de confidencialitat -
Auto-tuned OpenCL kernel co-execution in OmpSs for heterogeneous systems
(2019-03-01)
Article
Accés obertThe emergence of heterogeneous systems has been very notable recently. The nodes of the most powerful computers integrate several compute accelerators, like GPUs. Profiting from such node configurations is not a trivial ... -
Automatic communication coalescing for irregular computations in UPC language
(IBM, 2012)
Text en actes de congrés
Accés restringit per política de l'editorialPartitioned Global Address Space (PGAS) languages appeared to address programmer productivity in large scale parallel machines. However, fine grain accesses on shared structures have been identified as one of the main ... -
Automatic detection of parallel applications computation phases
(Institute of Electrical and Electronics Engineers (IEEE), 2014)
Text en actes de congrés
Accés restringit per política de l'editorialAnalyzing parallel programs has become increasingly difficult due to the immense amount of information collected on large systems. The use of clustering techniques has been proposed to analyze applications. However, while ...