Exploració per tema "Parallel programming (Computer science)"
Ara es mostren els items 1-20 de 273
-
A bi-objective parallel machine problem with eligibility, release dates and delivery times of the jobs
(Taylor & Francis, 2017-08-01)
Article
Accés obertThe scheduling of parallel machines is a well-known problem in many companies. Nevertheless, not always all the jobs can be manufactured in any machine and the eligibility appears. Based on a real-life problem, we present ... -
A case for user-level dynamic page migration
(Association for Computing Machinery (ACM), 2000)
Text en actes de congrés
Accés obertThis paper presents user-level dynamic page migration, a runtime technique which transparently enables parallel programs to tune their memory performance on distributed shared memory multiprocessors, with feedback obtained ... -
A framework for integrating data alignment, distribution, and redistribution in distributed memory multiprocessors
(2001-04)
Article
Accés restringit per política de l'editorialParallel architectures with physically distributed memory provide a cost-effective scalability to solve many large scale scientific problems. However, these systems are very difficult to program and tune. In these systems, ... -
A library implementation of the nano-threads programming model
(Springer, 1996)
Text en actes de congrés
Accés obertIn this paper we describe the design and implementation of a user-level thread package based on the nano-threads programming model, whose goal is to efficiently manage the application parallelism at user-level. Nano-thread ... -
A methodology approach to compare performance of parallel programming models for shared-memory architectures
(Springer, 2020)
Capítol de llibre
Accés obertThe majority of current HPC applications are composed of complex and irregular data structures that involve techniques such as linear algebra, graph algorithms, and resource management, for which new platforms with varying ... -
A novel asynchronous software cache implementation for the Cell-BE processor
(2008-10)
Article
Accés restringit per política de l'editorialThis paper describes the implementation of a runtime library for asynchronous communication in the Cell BE processor. The runtime library implementation provides with several services that allow the compiler to generate ... -
A parallel coupled algorithm for the solution of deformable two-body contact problem
(Barcelona Supercomputing Center, 2015-05-05)
Text en actes de congrés
Accés obertThis work presents a parallel iterative method for numerical solving frictionless contact problem for two elastic bodies. Each iterative step consists of a Dirichlet problem for the one body and a Neumann problem for ... -
A Parallel Monte Carlo Algorithm for Solving the Scattering Problem in Plasmonic Nanoparticles
(Universitat Politècnica de Catalunya, 2019-10-25)
Projecte Final de Màster Oficial
Accés obertWe propose an extension of the Ulam-Neumann algorithm for solving system of equations arising from photonic problems. This method has good parallel properties and allows to implement acceleration techniques. -
A programming model for hybrid workflows: combining task-based workflows and dataflows all-in-one
(Elsevier, 2020-12)
Article
Accés obertIn the past years, e-Science applications have evolved from large-scale simulations executed in a single cluster to more complex workflows where these simulations are combined with High-Performance Data Analytics (HPDA). ... -
A proposal for task parallelism in OpenMP
(2007-06)
Article
Accés restringit per política de l'editorialThis paper presents a novel proposal to define task parallelism in OpenMP. Task parallelism has been lacking in the OpenMP language for a number of years already. As we show, this makes certain kinds of applications difficult ... -
A proposal for task-generating loops in OpenMP
(Springer, 2013)
Text en actes de congrés
Accés restringit per política de l'editorialWith the addition of the OpenMP* tasking model, programmers are able to improve and extend the parallelization opportunities of their codes. Programmers can also distribute the creation of tasks using a worksharing construct, ... -
A proposal to extend the OpenMP tasking model with dependent tasks
(2009)
Article
Accés restringit per política de l'editorialTasking in OpenMP 3.0 has been conceived to handle the dynamic generation of unstructured parallelism. New directives have been added allowing the user to identify units of independent work (tasks) and to define points to ... -
A toolchain to verify the parallelization of OmpSs-2 applications
(Springer, 2020)
Text en actes de congrés
Accés obertProgramming models for task-based parallelization based on compile-time directives are very effective at uncovering the parallelism available in HPC applications. Despite that, the process of correctly annotating complex ... -
A trace-scaling agent for parallel application tracing
(Institute of Electrical and Electronics Engineers (IEEE), 2002)
Text en actes de congrés
Accés obertTracing and performance analysis tools are an important component in the development of high performance applications. Tracing parallel programs with current tracing tools, however, easily leads to large trace files with ... -
Accelerated execution via eager-release of dependencies in task-based workflows
(Sage, 2021-07-01)
Article
Accés obertTask-based programming models offer a flexible way to express the unstructured parallelism patterns of nowadays complex applications. This expressive capability is required to achieve maximum possible performance for ... -
Acceleration of the Geostatistical Software Library (GSLIB) by code optimization and hybrid parallel programming
(2015-12-01)
Article
Accés obertThe Geostatistical Software Library (GSLIB) has been used in the geostatistical community for more than thirty years. It was designed as a bundle of sequential Fortran codes, and today it is still in use by many practitioners ... -
Adapting the smart python agent development environment for parallel computing
(Universitat Politècnica de Catalunya, 2019-04-15)
Projecte Final de Màster Oficial
Accés obert
Realitzat a/amb: Universitat Rovira i Virgili / Universitat de Barcelona -
Adaptive MapReduce scheduling in shared environments
(Institute of Electrical and Electronics Engineers (IEEE), 2014)
Text en actes de congrés
Accés obertIn this paper we present a MapReduce task scheduler for shared environments in which MapReduce is executed along with other resource-consuming workloads, such as transactional applications. All workloads may potentially ... -
Adding tightly-integrated task scheduling acceleration to a RISC-V multi-core processor
(Association for Computing Machinery (ACM), 2019)
Text en actes de congrés
Accés obertTask Parallelism is a parallel programming model that provides code annotation constructs to outline tasks and describe how their pointer parameters are accessed so that they might be executed in parallel, and asynchronously, ... -
Advanced pattern based memory controller for FPGA based HPC applications
(Institute of Electrical and Electronics Engineers (IEEE), 2014)
Text en actes de congrés
Accés restringit per política de l'editorialThe ever-increasing complexity of high-performance computing applications limits performance due to memory constraints in FPGAs. To address this issue, we propose the Advanced Pattern based Memory Controller (APMC), which ...