Buffer sizing for self-timed stream programs on heterogeneous distributed memory multiprocessors

Carpenter, Paul Matthew; Ramírez Bellido, Alejandro; Ayguadé Parra, Eduard

doi:10.1007/978-3-642-11515-8_9

Visualitza/Obre

hipeac_ramirez_10.pdf (353,3Kb) (Accés restringit) Sol·licita una còpia a l'autor

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Carpenter, Paul Matthew

Ramírez Bellido, Alejandro

Ayguadé Parra, Eduard

Tipus de documentText en actes de congrés

Data publicació2010

EditorSpringer Verlag

Condicions d'accésAccés restringit per política de l'editorial

Attribution-NonCommercial-NoDerivs 3.0 Spain

Llevat que s'hi indiqui el contrari, els continguts d'aquesta obra estan subjectes a la llicència de Creative Commons : Reconeixement-NoComercial-SenseObraDerivada 3.0 Espanya

Abstract

Stream programming is a promising way to expose concurrency to the compiler. A stream program is built from kernels that communicate only via point-to-point streams. The stream compiler statically allocates these kernels to processors, applying blocking, fission and fusion transformations. The compiler determines the sizes of the communication buffers, which affects performance since local memories can be small. In this paper, we propose a feedback-directed algorithm that determines the size of each communication buffer, based on i) the stream program that has been mapped onto processors, ii) feedback from an earlier execution, and iii) the memory constraints. The algorithm exposes a trade-off between throughput and latency. It is general, in that it applies to stream programs with unstructured stream graphs, and it supports variable execution times and communication rates. We show results for the StreamIt benchmarks and random graphs. For the StreamIt benchmarks, throughput is optimal after the first iteration. For random graphs with stochastic computation times, throughput is within 3% of optimal after four iterations. Compared with the previous general algorithm, by Basten and Hoogerbrugge, our algorithm has significantly better performance and latency.

CitacióCarpenter, P.; Alex Ramirez; Ayguade, E. Buffer sizing for self-timed stream programs on heterogeneous distributed memory multiprocessors. A: International Conference on High Performance Embedded Architectures & Compilers (HiPEAC). "HiPEAC 2010 International conference on High-Performance Embedded Architectures and Compilers". Pisa: Springer Verlag, 2010, p. 96-110.

URIhttp://hdl.handle.net/2117/11345

DOI10.1007/978-3-642-11515-8_9

ISBN978-3-642-11515-8

Versió de l'editorhttp://www.springerlink.com/content/0g143884xj21n085/fulltext.pdf

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
hipeac_ramirez_10.pdf		353,3Kb	PDF	Accés restringit

UPCommons. Portal del coneixement obert de la UPC

Buffer sizing for self-timed stream programs on heterogeneous distributed memory multiprocessors

Visualitza/Obre

Explora