Automatic data distribution for massively parallel processors

García Almiñana, Jordi

doi:10.5821/dissertation-2117-93296

dc.contributor	Ayguadé Parra, Eduard
dc.contributor.author	García Almiñana, Jordi
dc.contributor.other	Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned	2011-04-12T15:02:31Z
dc.date.available	2005-02-07
dc.date.issued	1997-04-16
dc.date.submitted	2005-02-07
dc.identifier.citation	García Almiñana, J. Automatic data distribution for massively parallel processors. Tesi doctoral, UPC, Departament d'Arquitectura de Computadors, 1997. ISBN 8468909629. DOI 10.5821/dissertation-2117-93296.
dc.identifier.isbn	8468909629
dc.identifier.other	http://www.tdx.cat/TDX-0207105-110918
dc.identifier.uri	http://hdl.handle.net/2117/93296
dc.description.abstract	Massively Parallel Processor systems provide the required computational power to solve most large scale High Performance Computing applications. Machines with physically distributed memory allow a cost-effective way to achieve this performance, however, these systems are very diffcult to program and tune. In a distributed-memory organization each processor has direct access to its local memory, and indirect access to the remote memories of other processors. But the cost of accessing a local memory location can be more than one order of magnitude faster than accessing a remote memory location. In these systems, the choice of a good data distribution strategy can dramatically improve performance, although different parts of the data distribution problem have been proved to be NP-complete.<br/>The selection of an optimal data placement depends on the program structure, the program's data sizes, the compiler capabilities, and some characteristics of the target machine. In addition, there is often a trade-off between minimizing interprocessor data movement and load balancing on processors. Automatic data distribution tools can assist the programmer in the selection of a good data layout strategy. These use to be source-to-source tools which annotate the original program with data distribution directives.<br/>Crucial aspects such as data movement, parallelism, and load balance have to be taken into consideration in a unified way to efficiently solve the data distribution problem.<br/>In this thesis a framework for automatic data distribution is presented, in the context of a parallelizing environment for massive parallel processor (MPP) systems. The applications considered for parallelization are usually regular problems, in which data structures are dense arrays. The data mapping strategy generated is optimal for a given problem size and target MPP architecture, according to our current cost and compilation model.<br/>A single data structure, named Communication-Parallelism Graph (CPG), that holds symbolic information related to data movement and parallelism inherent in the whole program, is the core of our approach. This data structure allows the estimation of the data movement and parallelism effects of any data distribution strategy supported by our model. Assuming that some program characteristics have been obtained by profiling and that some specific target machine features have been provided, the symbolic information included in the CPG can be replaced by constant values expressed in seconds representing data movement time overhead and saving time due to parallelization. The CPG is then used to model a minimal path problem which is solved by a general purpose linear 0-1 integer programming solver. Linear programming techniques guarantees that the solution provided is optimal, and it is highly effcient to solve this kind of problems.<br/>The data mapping capabilities provided by the tool includes alignment of the arrays, one or two-dimensional distribution with BLOCK or CYCLIC fashion, a set of remapping actions to be performed between phases if profitable, plus the parallelization strategy associated. <br/>The effects of control flow statements between phases are taken into account in order to improve the accuracy of the model. The novelty of the approach resides in handling all stages of the data distribution problem, that traditionally have been treated in several independent phases, in a single step, and providing an optimal solution according to our model.
dc.language.iso	eng
dc.publisher	Universitat Politècnica de Catalunya
dc.rights	ADVERTIMENT. L'accés als continguts d'aquesta tesi doctoral i la seva utilització ha de respectar els drets de la persona autora. Pot ser utilitzada per a consulta o estudi personal, així com en activitats o materials d'investigació i docència en els termes establerts a l'art. 32 del Text Refós de la Llei de Propietat Intel·lectual (RDL 1/1996). Per altres utilitzacions es requereix l'autorització prèvia i expressa de la persona autora. En qualsevol cas, en la utilització dels seus continguts caldrà indicar de forma clara el nom i cognoms de la persona autora i el títol de la tesi doctoral. No s'autoritza la seva reproducció o altres formes d'explotació efectuades amb finalitats de lucre ni la seva comunicació pública des d'un lloc aliè al servei TDX. Tampoc s'autoritza la presentació del seu contingut en una finestra o marc aliè a TDX (framing). Aquesta reserva de drets afecta tant als continguts de la tesi com als seus resums i índexs.
dc.source	TDX (Tesis Doctorals en Xarxa)
dc.subject	Àrees temàtiques de la UPC::Informàtica
dc.subject.other	massively parallel processors
dc.subject.other	high performance fortran
dc.subject.other	automatic data recomposition
dc.subject.other	multicomputers
dc.subject.other	automatic data distribution
dc.subject.other	automatic data happing
dc.subject.other	3304. Tecnologia dels ordinadors
dc.title	Automatic data distribution for massively parallel processors
dc.type	Doctoral thesis
dc.subject.lemac	Processament en paral·lel (Ordinadors)
dc.identifier.doi	10.5821/dissertation-2117-93296
dc.identifier.dl	B.15683-2005
dc.rights.access	Open Access
dc.description.version	Postprint (published version)
dc.identifier.tdx	http://hdl.handle.net/10803/5981

Fitxers d'aquest items

Nom:: 01Jga01de01.pdf
Mida:: 919,4Kb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Departament d'Arquitectura de Computadors [361]
Totes les tesis [5.461]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Automatic data distribution for massively parallel processors

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora