Automatic communication coalescing for irregular computations in UPC language
Document typeConference report
Rights accessRestricted access - publisher's policy
Partitioned Global Address Space (PGAS) languages appeared to address programmer productivity in large scale parallel machines. However, fine grain accesses on shared structures have been identified as one of the main bottlenecks of PGAS languages. Manual or compiler assistance code optimization is required to avoid fine grain accesses. The downside of manually applying code transformations is the increased program complexity and hindering of the programmer productivity. On the other hand, compiler optimizations of fine grain accesses require knowledge of physical data mapping and the use of parallel loop constructs. This paper presents an optimization for prefetching and coalescing of shared accesses at runtime. Larger messages decrease the impact of remote access latency and increase the efficiency of the network communication. We have implemented our optimization for the Unified Parallel C (UPC) language. An experimental evaluation on a distributed-memory environment using a Power7 cluster demonstrates the benefits of our optimization.
CitationAlvanos, M. [et al.]. Automatic communication coalescing for irregular computations in UPC language. A: Conference of the Center for Advanced Studies on Collaborative Research. "Proceedings of CASCON 2012: Hilton Suites Toronto/Markham Conference Centre & Spa, Markham, Ontario, Canada: 5-7 November 2012". Toronto: IBM, 2012, p. 220-234.
|Automatic commu ... ations in UPC language.pdf||Automatic communication coalescing for irregular computations in UPC language.pdf||502.8Kb||Restricted access|